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Abstract 

We study the liberation process for projections: (p, q) i-> (pt , q) = {utpul , q) where u t is a unitary 
Brownian motion free from {p, q}. Its action on the operator-valued angle qptq between the projec- 
tions induces a flow on the corresponding spectral measures /it; we prove that the Cauchy transform 
of the measure satisfies a holomorphic PDE. We develop a theory of subordination for the boundary 
values of this PDE, and use it to show that the spectral measure fit possesses a piecewise analytic 
density for any t > and any initial projections. This demonstrates that the projections are al- 
ways rotated into general position for any t > 0, mirroring the finite-dimensional setting. As an 
application, we prove a special case of the Unification Conjecture for free entropy and information. 

1 Introduction 

Let V and W be subspaces of a the finite-finite dimensional complex space C d . If dim V and dim W are 
known, the minimal possible value of dim V R W is given by elementary linear algebra: dim V R W > 
maxjdim V + dim W — d, 0}. In fact, this inequality is almost surely equality: 

dim(L R W) = max{dim V + dim W - d, 0} a.s. (1.1) 

The almost surely can be interpreted in a number of ways: for example, it can be taken with respect 
to any reasonable probability measure on the (product) Grassmanian manifold of subspaces. We will 
discuss a more probabilistic interpretation shortly. 

When two subspaces satisfy the equality of Equation 1.1, they are said to be in general position. 
For convenience, we may rewrite this relation as follows. Let P, Q be the orthogonal projections onto 
V and W respectively, and let P A Q denote the orthogonal projection onto V R W. The dimension of 
a subspace is the trace of the projection onto it, so the subspaces are in general position if and only if 
Tr (P A Q) = max{Tr P + Tr Q — d, 0}. Normalizing, letting tr = ^Tr , two projections (i.e. subspaces) 
are in general position if and only if 

tr(PAQ) =max{trP + trQ- 1,0}. (1.2) 



In this language, a good way to express the genericity of general position is as follows: let P, Q be 
as above, and let U be a random unitary matrix. If P = UPU* (i.e. the projection onto the rotation of 
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the image of P by U), then P and Q are in general position almost surely. This statement is valid for 
all reasonable notions of random unitary matrix; for example, it holds true if U is sampled from some 
measure that has a continuous, strictly positive density with respect to the Haar measure. (Indeed, 
the subset of the Grassmannian product where general position fails to hold is a subvariety of lower 
dimension, and so it is easy to see that any reasonable measure will assign it probability 0.) Since 
any neighborhood of the identity has positive measure, it follows that rotations arbitrarily close to the 
identity produce projections in general position, agreeing with our intuition. Note: the same result 
applies as well even if P, Q are random projections, provided that the random unitary U is independent 
from {P,Q}. 

An important flow of random unitaries is given by the unitary Brownian motion. The group 
U (d) of unitary d x d matrices is a compact Lie group, and so its Riemannian metric (given by the 
killing form) gives rise to a heat kernel, which generates a Markov process: Brownian motion U t . This 
stochastic process can be constructed from the standard Brownian motion Wj on the Lie algebra Ud 
of Ud- it satisfies the (Stratonovich) stochastic differential equation dUt = Ut o dWt- The algebra 
consists of skew-Hermtian matrices; it is much more common to think of Brownian motion taking 
values in Hermitian matrices, so set B t = — iWt- Making this substitution, and converting the SDE to 
Ito form, gives 

dU t = iU t dB t - -U t dt. (1.3) 

(Note: this equation corresponds to the normalization Etr (B^) = t where tr is the normalized trace.) 
The distribution of Ut is the heat kernel at time t; on any Lie group, this has a strictly positive smooth 
density for any t > 0, cf. [27]. Hence, the above discussion proves the following. 

Proposition 1.1. Let P, Q be orthogonal projections on C d . Let Ut be the Brownian motion on the unitary group 
U (d), and set P t = UtPU^. Then for each t > 0, P t and Q are almost surely in general position. 

Beyond the question of general position (relating only to dimension), it is interesting to consider the 
relative position of the two subspaces, and how it evolves under the Brownian conjugation. The prin- 
ciple angles between P t and Q are encoded in the operator-valued angle QPtQ (whose eigenvalues are 
trigonometric polynomials in said angles). 

The purpose of the present paper is to address the question of general position, and more generally 
the nature of the flow of the operator- valued angle, in an infinite-dimensional context. Let s/ be a IIi- 
factor - a von Neumann subalgebra of the bounded operators on some Hilbert space "K, with a unique 
tracial state r. (For intuition, one may think of a limit as d — > oo of the algebra ofdxd random matrices; 
we will make the notion of such a limit precise in Section 1.1; the limit can always be identied as living 
in a free group factor.) Let p, q e s/ be projections, and let p A q be the projection onto p'K n q'K. Then 
p A q is in s/, since s/ is weakly closed. Say that p, q are in general position if 

r(p Aq) = max{r(p) + r(q) - 1,0}. (1.4) 

Instead of attempting to make sense of a random operator drawn from the unfathomably large group 
of unitaries on %, we use the tools of free probability (Section 1.1) to proceed. We may assume that s/ 
is rich enough to possess a free unitary Brownian motion ut (Section 1.3) free from {p, q}, by enlarging 
s/ if necessary. The operator-valued process u t can be thought of as a limit of the Brownian motions 
Ut in U(d) as d — > oo (cf. [3]). In [2, Thm. 8.2], the authors claimed to prove the following: If {p, q} 
are projections in a \\\-factor, and ut is a free unitary Brownian motion, freely independent from {p, q}, then 
Pt = utpiit and q are in general position for each t > 0. Unfortunately, their proof was flawed (as will be 
briefly discussed in Section 1.4). The present paper arose, in part, as an attempt to address this flaw. 
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The process t i->- (pt, q) (as t ranges through [0, oo)) is known as the free liberation of the initial 
pair (p, q). It was introduced in [29] as a technical tool for the analysis of free entropy and free Fisher 
information. As t — > oo, the free unitary Brownian motion u t tends (in the weak sense) to a Haar 
unitary operator. (This is the infinite-dimensional version of the statement that the heat kernel measure 
on the unitary group flows towards the Haar measure.) Thus, the pair (p t , q) tends towards (upu*,q) 
where u is a Haar unitary free from {p, q}. The two operators upu* and q are therefore free (cf. [20]), 
and so the spectral measure u of the self-adjoint operator q x l 2 upu* q 1 ! 2 = qp t q is given by the free 
multiplicative convolution of the spectral measures of upu* and q separately (cf. [31]). Let r(p) = a 
and r(q) = (3. Then p. upu * = n p = (1 — a)5o + a5\ while fi q = (1 — (3)5q + (38\. This convolution was 
calculated explicitly in [31, Ex. 3.6.7], via the Cauchy transform (cf. Section 1.1): setting u = p, p M u q , 
the authors show that 

f 1 n{dx) z + a + g - 2 - - 2(q + g - 2a(3)z + (a -~W c „ n « 

G - (z) = J — = 24^1) > - c +- 

The Stieltjes inversion formula (again see Section 1.1) then shows that 

a = (1 — min{a, (3})5o + max{a + (3 — 1, 0}<5i H ; lr r _ r i dx, (1.6) 

27TX(1 — X) 1 ' J 



where r± = a + (3 — 2a(3 ± 2-y^a/3(l — a)(l — f3) are the roots of the quadratic polynomial in the radical 
in Equation 1.5. It should be noted that this measure is precisely the limit of the empirical eigenvalue 
distribution of a Jacobi Enselmbe in random matrix theory, which has recently been much studied in 
part due to its applications to MANOVA (multivariate analysis of variance) problems in statistics, cf. 
[8, 9, 10, 11, 12, 14]. 

The general position rank shows up as the mass of the spectral measure \i concentrated at 1; this is 
no accident. By a theorem of von Neumann (cf. [32]), p A q is the weak lim n ^ ) . 00 (pq) n , and so 

r(pAq)= lim T[(pq) n ] = lim T[(qpq) n ] = lim / x n fj, qpq (dx) (1.7) 

which is precisely equal to n qpq {l}. The presence of the mass max{a + ft — 1, 0} at 1 in /x = ^ qup u*q 
therefore shows that upu* and q are in general position; that is, (pt, q) flow towards general position 
as t — > oo. Corollary 3.24, which follows directly from our main Theorem 1.5 below, shows that the 
same property holds for all t > 0, regardless of the initial distribution: there is a structural point mass 
of magnitude max{a + (3 — 1, 0} at 1, at least in the case a = (3 = |. 

In the present paper, we study the spectral measure fit = Hqp t q for all t > 0. As the calculation 
above shows, in the limit as t — > oo, the operator-valued angle qptq flows towards a universal form 
determined only by r(p) and r(q). For any t > 0, the law fi t completely determines the structure of the 
von Neumann algebra generated by pt and q, cf . [16, 17, 23]. Properties of this measure fa are important 
for the analysis of free entropy in [17]. The main theorems of this paper relate to the analysis of this 
flow. Since the limiting spectral measure u (the weak limit of \it as t — > oo) has a (structural) mass of 
1 — min{a, (3} at 0, it is sensible to remove this static singularity from the dynamics. The first major 
theorem of this paper shows that this produces a smooth flow equation for the Cauchy transform of 
Ut', and the Cauchy transform itself is analytic in both z and t. 

Theorem 1.2. Let p, q be projections in a ll\-factor, and let ut be a free unitary Brownian motion freely inde- 
pendent from {p, q}; set p t = u t pu\. Let r(p) = a and r(q) = f3, and let \x t denote the spectral measure of the 
operator-valued angle qptq- For t > and z in the upper half-plane C+, let 

G(t,z)= fM^ J-^fl ^ (lg) 
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Then the function G is analytic in both z e C + and t > 0, and satisfies the complex PDE 




d_ 

dz 



[z(z - 1)G 2 - {az + b)G] 



(1.9) 



where a = 2 min{a, /3} — 1 and b = \a — f3\. 

Remark 1.3. PDE 1.9 is qualitatively similar to the complex (inviscid) Burger's equation, which is a well- 
known example exhibiting blow-up in finite time with bounded initial data, as well as shock behavior. 
There is no reason to expect the stated analyticity result to follow from the PDE; in fact, our analysis 
will use tools from PDE, complex analysis, and from free probability to prove the a priori analyticity 
(in both variables) of the function G. 

Section 2 is concerned with the proof of Theorem 1.2. The idea is to use the free stochastic calculus 
of Section 1.2 to find a system of ODEs satisfied by the time-dependent moments T((qptq) n ) for n G N. 
These moments are the coefficients of the Laurent-series expansion of G in a neighbourhood of oo, and 
on this domain the ODEs combine to give PDE 1.9. Analyticity, and continuation to all of C+, is then 
a technical matter proved with careful estimates on the growth of derivatives given by the original 
ODEs and iteration of the PDE. 

Remark 1.4. The papers [1] and [9] consider similar situations, developing PDEs governing the flow 
of analytic function transforms of spectral measures associated to the free liberation of two operators; 
in both cases, the PDEs are compatible with Theorem 1.2 above. In [1], the two initial operators are 
assumed to be classically independent, hence commutative; in this case, the authors were able to solve 
the PDE explicitly: their solution is a dilation of the law of the free unitary Brownian motion u t , studied 
in [3]. The measure analogous to our p t is given meaning as a t-free convolution, yielding a continuous 
interpolation from classical independence to free independence. [9] considers a situation similar to 
ours, with the assumption that one projection dominates the other; hence, the scope is similarly less 
extensive than the general situation we presently treat. The benefit of these specializations is that they 
obtain explicit solutions in particular cases, as well as unexpected algebraic identities. 

In principle, the measure pt can be recovered from the function G(t, z) via the Stieltjes inversion 
formula (cf. Section 1.1). In practice, understanding the flow of the boundary values of the function 
from a PDE in the interior is a very difficult problem in partial differential equations. We are, as yet, 
unable to complete the analysis of the measure pt in general; however, in the special case a = (3 = \ 
(corresponding to a = b = in PDE 1.9), we have the following complete analysis. 

Theorem 1.5. Let fi t = \5q + v t be the spectral measure in Theorem 1.2 in the specal case a = /3 = \. Then for 
any t > 0, the measure v t possesses a continuous density p t on (0, 1). For any t > 0, there is a constant C(to) 
so that, for all t > to, 



Finally, the function p t is real analyic on the set {x € (0, 1) : pt(x) > 0}. 

Note that the bound on p t precisely reflects the asymptotic form the measure takes as t — > oo: in 
the case a = f3 = \, the Jacobi density of Equation 1.6 reduces to the (shifted) arcsine law (2tt)~ 1 (x(l — 
x))" 1 / 2 on [0, 1]; thus, we cannot expect any better behavior at the boundary of the interval. Theorem 
1.5 shows that the measure fit does not possess any mass at the endpoints: although it may blow up at 
x = 1, the singulary is milder than would reflect the Cauchy transform of a point mass. 

The smoothing results of Theorem 1.5 mirror similar properties of the so-called free heat flow: if p 
is a compactly-supported measure on M and at(dx) = -^- t \J (4t — x 2 ) + dx is the semicircular law of 
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variance t, then the free convolution fx EB a t possesses a continuous density which is real analytic on 
the set where it is > 0, cf. [5]. The techniques used to prove this theorem do not involve any PDEs, but 
are based on Biane's theory of subordination for the Cauchy transform. Motivated by those ideas, our 
approach to Theorem 1.5 is to develop an analogous theory of subordination for the liberation process: 
we show that, changing variables H(t, z) = yfz\fz — lG(t, z), the flow of H may be encoded by a 
deformation of the identity in the initial condition: H(t, z) = H(0, ft{z)) for a subordinate function f t 
which extends to a homeomorphism from the closed upper half-plane C + onto a domain Q t C C + . 
There is at least one striking difference between these results and those of [5], however. 

Theorem 1.6. Let p t be the spectral measure in Theorem 1.2 in the special case a = /3 = i. For any s,t > 0, 
supp p s and supp pt are homeomorphic. 

Thus support intervals of pt never collide; any two separated components remain separated for all 
time. This contrasts strongly with the measure flow in [5], where collisions do occur and can produce 
more severe singularities than existed at the boundary points before the collision. In the limit, pt 
converges to the Jacobi measure in Equation 1.6, which in the case a = j3 = \ is the shifted arcsine law 

\5 {dx) + p(dx), 

P( x ) = , n 1 [o,i] 0*0 
2it^x(l — X) 

whose support is the full interval [0, 1]. It follows that unsupported intervals for p t flow (at finite 
speeds, cf. Lemmas 3.2 and 3.3) into small neighborhoods of the endpoints {0, 1} as time increases, 
crowding and eventually disappearing in the limit. This is detailed in Section 3.3. 

Aside from the motivating question of general position for projections, the main application of the 
present results is to Voiculescu's theory of free entropy. In a (still ongoing) attempt to resolve the free 
group factor isomorphism problem, Voiculescu invented free probability as a way to import tools from 
classical probability, notably entropy and information theory, to produce invariants to distinguish von 
Neumann algebras. Motivated both by Shannon's original entropy constructions involving spacial mi- 
crostates, and more sophisticated constructions in information theory (using conjugate variables, for 
example), he introduced free analogues of entropy, Fisher's information, and mutual information of 
collections of non-commuting random variables, in a series of six papers from 1993-1996; the most 
relevant for our purposes is [29]. Classically, there are many relationships that hold between these 
information measures; the general question of proving the analogous relationships for their free coun- 
terparts is known as the Unification Conjecture in free probability. In Section 4, we briefly describe 
the precise form of the unification conjecture, and prove it in the special case of von Neumann algebras 
generated by two projections (of trace \). The main result is as follows. 

Theorem 1.7. Let p,q G (^,r) be two projections in a W\-factor, with trace r(p) = r(q) = \. Suppose 
that the operator-valued angle qpq has a spectral measure with a strictly positive density satisfying the bound of 
Equation 1.10. Then the free mutual information ofp, q and the free relative entropy ofp, q are equal: 

i*(P,q) = -Xproj{P,q) + Xproj(p) + XprojO?) = -Xproj (P, <?)• (1-11) 

The density bound is a technical assumption which is only needed in one very minor step in the proof; 
it is likely that the theorem stands for all pairs of projections. Moreover, the assumption on the traces 
is in place since it is required for Theorem 1.5. The extension of this theorem to the general case of the 
algebra generated by two projections will be discussed in a future publication. 

This paper is organized as follows. The remainder of Section 1 is devoted to the relevant back- 
ground for the rest of the work presented: Section 1.1 fixes the basic ideas and notation of free prob- 
ability; Section 1.2 discusses the free Ito calculus; Section 1.3 is devoted to the free unitary Brownian 
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motion that is central to this paper; and Section 1.4 describes the flaw in the proof of the general po- 
sition result [2, Thm. 8.2] that partly motivated the present work. The main results of this paper are 
in Sections 2-4: Section 2 is the proof of Theorem 1.2; Section 3 uses this result to develop local prop- 
erties of the measure \it, including the proof of Theorems 1.5 and 1.6; and Section 4 presents our main 
application, proving Theorem 1.7. 

1.1 Free probability 

Here we briefly record the basic ideas and notation used in the sequel; the reader is directed to the 
books [20] and [31] for a full treatment. The setting of free probability is a non-commutative probability 
space; we will work in the richer framework of a T4 / *-probability space r) where £/ is a von 
Neumann algebra, and r is a weak* -continuous, faithful traical state on &f. The elements in srf are 
called (non-commutative) random variables. The motivating example is the space si = Homc(C d ) 8> 
L°° (Q, Jp, P) of all random matrices with bounded entries (over a probability space (f2, J^", P)); here the 
tracial state is r = ^Tr^ (g> E, the expected normalized trace. Voiculescu's key observation was that this 
W* -probability space can be used to approximate (in the sense of moments) non-commutative random 
variables in infinite-dimensional von Neumann algebras, and classical independence combined with 
randomization of eigenvectors for random matrices gives rise to a more general independence notion 
modeled on free groups. 

Definition 1.8. Let (s/,t) be a W* -probability probability space. The *-subalgebras Ai,...,A n C ssf are 
called free or freely independent if, given any centered elements en G A\, r(aj) = 0, and any sequence 
h,i2, ■ ■ ■ ,im £ {I, • • • , n } with i k ^ i k+ i for 1 < j < m, T(a il a i2 ■ ■ ■ ctj m ) = 0. Two random variables 
a,b G sf are freely independent if the *-subalgebras they generate are freely independent. 

Free independence is a moment-factorization condition. For example, if a, b are freely independent, 
then T(a n b m ) = r(a n )r(6 m ) for any n, m G N, coinciding the classical independence of bounded ran- 
dom variables; for non-commutating variables, freeness includes more complicated factorizations such 
as r(abab) = r(a 2 )r(6) 2 + r(a) 2 r(6 2 ) — r(a) 2 r(6) 2 . Freeness is modeled on freeness in group theory. Let 
s/ = L(Ffc) denote the free group factor with k generators ui, . . . , u k . (L(¥k) is the von Neumann al- 
gebra generated by the left-regular representation of the free group on £ 2 (¥ k ); i.e. the von Neumann 
algebra generated by convolution on the free group.) Then subalgebras generated by disjoint subsets 
of the generators {ui, . . . , u^} are freely independent. 

If {{£? n ,T n ) : 1 < n < oo} are W* -probability spaces, and k G N, a sequence a n = (a\, . . . ,a^) G 
(ja4) fc is said to converge in distribution to a = (a 1 , . . . ,a k ) G if, for any polynomial P in k 
non-commuting indeterminates, T„[P(a n )] — > ^[^(a)] as n — > oo. That is: each mixed moment in a n 
converges to that moment in a. Such a sequence is said to be asymptotically free if its limit consists of 
random variables a 1 , . . . , a k that are freely independent. 

Proposition 1.9 ([20, 31]). Let X n and Y n benxn random matrices with all moments finite, and suppose that 
each has a limit in distribution separately. Let U n be a unitary matrix sampled from the Haar measure on U (n), 
independent from X n and Y n . Then (X n , U n Y n U*) are asymptotically free. 

Proposition 1.9 asserts that freeness is an asymptotic statement about the eigenvectors of random ma- 
trices: freeness means that their eigenspaces are independently, uniformly randomly rotated against 
each other. This result allows the realization of free random variables and stochastic processes as limits 
of random matrix ensembles; more on this in Section 1.2. 

If a G t) is a single self-adjoint random variable, it possesses a spectral resolution E a taking 
values in the projections of The composition r o E a produces a probability measure p, a on the 
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spectrum of a, known as the spectral measure of a; it is determined by the moments of a: 

r{a n ) = [ x n p a (dx). 



In general the non-commutative distribution of a random vector a £ srf k is the set of all traces of non- 
commutative polynomials in a; in the case of a single self-adjoint element (k = 1), these moments are 
encoded by the single measure p a , coinciding with the law of a classical random variable. In the case 
of a self-adjoint random matrix a, p, a is the average empirical eigenvalue distribution (the average of 
the random probability measure placing a point-mass at each eigenvalue of the matrix). 

Definition 1.10. Let pbe a compactly-supported finite positive measure on M. The Cauchy transform is 

the analytic function on the upper half-plane C + defined by 



G„(z) = / p(dx). (1.12) 

Jrz - X 

The function G a is analytic on C — supp p, but does not have a continuous extension across supp p. A 
generally useful uniform estimate for the Cauchy transform in the upper half-plane is 

\GM < (1.13) 

Another reason it is customary to restrict it to the upper half-plane is that the measure can be recovered 
from its action there, via the Stieltjes inversion formula: 

H(dx) = limSGUx + te). (1.14) 

7T (40 

The limit in Equation 1.14 is a weak limit: if / is a continuous test function, the integral of / against 
the e-dependent measure on the right converges to f f dp,; if p possesses a sufficiently integrable con- 
tinuous density, the limit is also true pointwise for the density. If p(dx) = p(x) dx where p £ L' P (M.) for 
some p £ (0, oo), then the Hilbert transform 

Hp(x) = - p.v. [ -?^L dy (1.15) 
7T J x-y 

is also in LP, and gives the boundary values of the real part of G^; that is 

limQGJx + ie) = p(x), — limKG M (x + ie) = Hp(x). (1.16) 

7T <e4-0 7T <40 

The Stieltjes inversion formula is also robust under vague limits of measures. 
Theorem 1.11 (Stieltjes continuity theorem). Let p n be a sequence of probability measures on !L 

(1) If p n — > p weakly, then G^ n —?■ G^ uniformly on compact subsets ofC + . 

(2) Conversely, if G^ n — > G pointwise on C+, then G is the Cauchy transform of a finite positive measure 
p, and p n — ^ p vaguely. If it is known a priori that G = G n for a probability measure p, then p n — ^ p 
weakly. 
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It is possible for mass to escape at oo in a vagule limit; for example, it is possible for G^ n — > point- 
wise. In our applications, the limit will be directly identified as the Cauchy transform of a probability 
measure. 

Freeness, which is a property of moments (hence of distributions), can be encoded in terms of 
the Cauchy transform. Given a compactly-supported probability measure /x, the ^-transform 8%^ is 
the analytic function on a neighborhood of the identity in the upper half-plane determined by the 
functional equation 

G li (& (t (z) + l/z)=z, zGC+, |*| small. (1.17) 

The ^"-transform in fact determines the Cauchy transform, by analytic continuation, modulo the con- 
straint limui^oo zG^(z) = fi(M) = 1. It is the free analogue of the (log-)Fourier transform: it linearizes 
freeness. 

Proposition 1.12 ([31]). Let a, b be self-adjoint random variables. Then a, b are freely independent if and only 
if 

®»*M 

for all sufficiently small z G C+. 

This highlights the fact that free independence yields a free convolution operation on measures. Given 
two (compactly-supported) probability measures fi, v, their (additive) free convolution /x EH v is the 
measure determined by Si^ v = + 8% v . That is: realize /j, and v as the laws of two free random 
variables a, b; then \i EH v is the law of a + b. 

There is a similar notion of (multiplicative) free convolution for positive operators. If a and b 
are free, then the law of a 1 / 2 6a 1 / 2 is denoted fj, a M /i^. It is determined through another analytic func- 
tion transform known as the ^-transform. For a compactly-supported probability measure \i, let 
Xn(z) = \G^(\) — 1 be its shifted moment-generating function, defined in a neighborhood of 0. The 
^-transform ^ M is the analytic function defined in a neighborhood of by the functional equation 

X» (i^r-W) = M smalL (1.18) 

Then ^^(z) = ^^{z)S^ v (z), as shown in [31]. It is through this operation that the free Jacobi law of 
Equation 1.6 was determined. 

1.2 Free Brownian motion and free stochastic calculus 

Let X n (t) denote an n x n Hermitian matrix- valued Brownian motion: the upper-triangular entries 
[X n (t)]ij,i < j are independent complex Brownian motions of total variance t/n; the diagonal does 
not matter to the limit. Then there is a W* -probability space r) in which the limit in distribution 
of any collection (X n (t{), X n (t2), . . . ,X n (tk)) for < t\ < ti < ■ ■ ■ < tk may be realized. For fixed t, 
X n (t) is a Gaussian unitary ensemble, whose limit empirical eigenvalue distribution is Wigner's semi- 
circle law at(dx) = 2it"vG^ — x2 )+ d% [33, 34]; hence the limits xt x , ■ ■ ■ , x tk are semicircular random 
variables. Mirroring the isonormal Gaussian process construction of Brownian motion, the limit may 
be realized in any W / *-probability space rich enough to contain an infinite sequence of freely indepen- 
dent identically distributed semicircular random variables (e.g. any free group factor). The limit xt is 
a non-commutative stochastic process with properties analogous to Brownian motion. 

Definition 1.13. A(n additive) free Brownian motion in a W* -probability space (g/, r) is a non-commutative 
stochastic process (x t )t>o with the following properties: 
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(1) The increments of xt are freely independent: for < t\ < £2 < ■ ■ ■ < tk, 
are freely independent. 

(2) The process is stationary, with semicircular increments: for < s < t, the law of xt — x s is at- s - 

(3) The £/ -valued function 1 1->- x t is weakly continuous. 

Free Brownian motion is the limit of matrix-valued Brownian motion; it can also be constructed as an 
isonormal process, or through a Fock space construction, cf. [6]. With the properties of Definition 1.13 
in hand, the standard construction of the Ito integral may be mirrored. If 9 t is a process adapted to xt 
(meaning that 6t is in the von Neumann subalgebra generated by {x s } s < t for each t > 0), then one can 
define the stochastic integral 

J 9 t dx t 

as an L 2 (£/, r)-limit of step functions of the form J2k ®t k (xt k — x t h _-i)- The relationship 4> t = f &t dxt is 
abbreviated as d(f> t = 9 t dxt- A (left) free Ito process yt is a stochastic process of the form 

y t = f 9 s dx s + [ 4> s ds (1.19) 
Jo Jo 

where 9t and 4>t are adapted processes; here J * 9 S dx s is short-hand for f 9 s t\o ,t]i s ) dx s and so forth. 
Evidently, a free Ito process is adapted. Equation 1.19 is usually written in the form 

dy t = 9 t dx t + (P t dt. (1.20) 

This stochastic differential notation is useful, and allows for succinct description of the rules of free Ito 
calculus. The most important is the free Ito formula which we will use in product form: 

d(y t zt) = (dy t )zt + yt{dz t ) + (dy t )(dz t ) (1.21) 

Here yt and z t are free Ito processes. For example, if dy t = 9 t dxt + 4>t dt and dzt = 9' t dxt + <p' t dt, then 

yt(dzt) = y t {9' t dx t + 4>'t dt) = (y t 0' t ) dx t + (yt<f> t ) dt 

while 

(dy t )z t = (6t dx t + 4>t dt)z t = 9 t dx t z t + (9 t z t ) dt. 

The processes are non-commutative, so we must be able to make sense of such terms; moreover, the 
product (dy t )(dzt) will contain mixed terms dxt dt and so forth. The rules, akin to standard Ito calculus, 
for these terms are as follows: 



dx t 9tdx t = T(9t)dt (1.22) 
dx t dt = dt dx t = (dt) 2 = (1.23) 

Equation 1.22 may seem counter-intuitive from classical Ito calculus: for a standard 1-dimensional 
Brownian motion B t , (dB t ) 2 = dt. It is easy to calculate, however, that for n x n matrix-valued Brow- 
nian motion X n (t) and an adapted matrix- valued process Y n (t), the Ito product-rule takes the form 
dX n (t) Y(t)dX n (t) = ^Tr (Y n (t)) dt. Therefore, Equation 1.22 follows from standard trace-concentration 
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results, cf. [6]. One final useful result is that free Ito integrals of adapted processes, like their classical 
cousins, are centered; i.e. 

r(0t dx t ) = 0. (1.24) 

The standard approach from classical stochastic calculus (using the Picard iteration technique, for ex- 
ample) can then be used to solve free stochastic differential equations of the form 

dy t = a(t, y t ) dx t + b(t, y t ) dt (1.25) 

for sufficiently smooth functions a, b: M + x srf — > srf . The reader is referred to [6, 7] for details. 

Remark 1.14. As the calculations following Equation 1.21 demonstrate, the product of two left free Ito 
processes is not, in general, a left free Ito process: it may contain terms like dxt 9t in differential form, 
which is not equal to 6 t dx t . A proper treatment of free Ito calculus should be formulated in terms of 
biprocesses t ^ uj t ^ ® that can act on the left and the right simultaneously. Biane and Speicher 
develop this theory in [6]; the corresponding stochastic integral is denoted 

J u t %dx u 

defined as an L 2 -limit of sums of the form J2k ®t k (%t k — x t k _ l )4>t k where ut is approximated by J2k ®t k ® 
4>t k . We have described free Ito calculus in slightly imprecise terms to avoid the new notational com- 
plexity; for our purposes, the more general theory is essentially unnecessary. The interested reader can 
find a careful overview in [19]. 



1.3 Free unitary Brownian motion 

Introduced in [3], the free unitary Brownian motion is the solution to the free Ito stochastic differential 
equation 

dut = iut dxt — —Ufdt (1-26) 

with initial condition uq = 1; here, as usual, xt is a(n additive) free Brownian motion. It would, 
perhaps, be most accurate to call the solution of this free SDE a left free unitary Brownian motion. To 
be sure, note that the adjoint u* t satisfies 

dv* t = (du t )* = —idxtv* t — -u* t dt (1.27) 

since x t is self-adjoint. Equation 1.26 is the exact free analogue of Equation 1.3 for the finite-dimensional 
unitary Brownian motion. Indeed, this comes from the fact that u t is the limit in distribution of the 
Brownian motion on the unitary group, in the same sense that x t is the limit in distribution of the 
Hermitian-matrix valued Brownian motion. It is a unitary-valued stochastic process, whose distribu- 
tion is a continuous function on the unit circle for each t > 0. Biane calculated the moments of this 



measure: 



r- \j + 1 

j— u 

Using these moments, it is possible to compute an implicit description of the density of fi Ut (with 
respect to the uniform probability measure on the unit circle); for t < 2, the measure is supported in a 
strict symmetric subset, and then achieves full support for t > 2. 

Analogous to Definition 1.13, and in line with the properties of the Brownian motion on the unitary 
groups, the free unitary Brownian motion has the following properties, which can be derived directly 
from the stochastic differential equation 1.26. 
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Proposition 1.15 ([3]). Let v t be the measure on the unit circle possessing the moments on the right-hand-side 
of Equation 1.28. Then the free unitary Brownian motion satisfies the following properties. 

(1) The multiplicative increments ofut are freely independent: for < t\ < ti < ■ ■ ■ < t^, 

u t2 u* tl , u ts u* t2 , u^u*^ 

are freely independent. 

(2) The process is stationary: for < s < t, the law of the unitary random variable utu* is u t - s . 

(3) The &f -valued function 1 1->- ut is weakly continuous. 

Comparable statements can be made about the right free unitary Brownian motion u% . 

1.4 The Flaw in [2, Thm. 8.2] 

The final theorem in the first author's paper [2] claimed to prove the motivating theorem of the present 
paper: if p, q are projections in a W* -probability space {&/ , r), and if and Ut is a free unitary Brownian 
motion, freely independent from {p, q}, then p t = utpu\ and q are in general position for all t > 0. The 
idea of the given proof was as follows. First, by making replacements p «->■ 1 — p and q <-> 1 — q if 
necessary, it suffices to prove the theorem in the case r(p), r(q) < \, in which case the general position 
statement is that r(p t A q) = 0. Let r t = pt A q, and define a function F t : R + — > R by 

F t(s) = r[(r t p s rt - r t ) 2 }, s > 0. 

The function F t is non-negative. Since r t is a projection onto a subspace of the image of p t/ r t p t r t = p t 
and so F t (t) = 0; hence F t (s) has a minimum at s = t. The claim is then made that the function F t is 
differentiable, and so F[(t) = 0. Free Ito calculus is then applied to calculate the derivative; it is then 
claimed that F[(t) < — r(r t ) 2 r(p). Thus, F[(t) < unless r(r t ) = as required. 

There are two flaws with this argument. The first, fairly subtle, is the assumption of differentiability. 
The stochastic process s H> (r t p s r t — r t ) 2 is manifestly not adapted to the filtration generated by (u s ) s >q\ 
for s < t, it depends on the value of u t . It is only adapted for s > t, in which case the tools of Ito 
calculus indeed apply as stated. Thus, it was correctly demonstrated that the function F t (s) has a 
minimum at s = t, and is differentiable for s € [t, oo) (meaning right-differentiable at s = t). If it were 
indeed true that a' t (t) < —T(r t ) 2 T(p), the argument would remain valid: a right-differentiable function 
cannot have a minimum at a point where the right-derivative is strictly negative. Unfortunately, there 
was also a calculation error in the determination of this derivative. In fact: 

Proposition 1.16. The right-derivative of F t satisfies F[(t) = 2r(rt)(l — r(p)) > 0. 

This produces no contradiction to the possibility that r(r t ) > 0, however: since the function F t (s) is not 
known to be differentiable in a neighborhood of s = t, its right-derivative may well be strictly positive 
at the minimum (with behavior akin to the function si—)- \s — t\, for example). The proof of Propostion 
1.16 is delayed until Section 2.2. 

It is possible this proof could be mended using a free version of stochastic calculus for non-adpated 
processes, which has yet to be developed. In the classical case, such techniques are based on the 
Malliavin calculus [21], which does have an analogue in the world of free probability, developed in 
[6, 7] and further developed in the second author's recent paper [19]. Nevertheless, even if such a non- 
adapted calculus were applicable and mirrored the classical behavior, it is likely the function F t (s) 
could still not be proved differentiable at the point s = t, but only on the complement of this point in 
R+. Thus, to prove the general position claim, fundamentally different techniques are required; this is 
part of the impetus for the present paper. 
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2 The Flow of the Spectral Measure m 

2.1 The flow of moments 

Let p, q G srf be projections with r(p) = a and r(q) = (3. Let Ut be a free unitary Brownian motion, free 
from p, q, and (as usual) define pt = u t pu* for t > 0. Our present goal is to understand the moments 

9n(t) = T[(qp t q) n ] n>l,t>0. (2.1) 

We will use stochastic calculus to derive a system of ODEs satisfied by the functions g n on (0, oo). To 
that end, we need a generalization of the I to product rule of Equation 1.21 to products of many adapted 
processes. An easy induction argument shows that if a\(t), ... , a n (t) are adapted processes then 

n 

d(ai ■ ■ ■ a n ) = ^2 a± ■ ■ ■ a-j-i daj Oj+i • • • a n + ^2 a± ■ ■ ■ aj_i dai aj+i • • • Oj_i cfaj Oj+i ■ ■ ■ a„. 

j'=l l<i<j<n 

(The induction follows from the fact that dai daj da^ = for any k; this follows from repeated 
applications of the rules of Equation 1.23.) Specializing to the case a\ = ■ ■ ■ = a n = a, we have 

n 

d(a n ) = a j_1 da a n ~ j + ^ a*" 1 da a?- ' 1 ' 1 da a n ~i (2.2) 

j=l l<i<j<n 

(Note: Equation 2.2 only makes sense for n > 2.) We will apply this to the adapted process at = qptq, 
and then take the trace to find dg n (t). This affords immediate simplifications: 

n 

r[d(a n )} = ^ ^K" 1 da a n ~ j ] + ^ r^" 1 da a?' 1 ' 1 da a n ~ j ] 

j=l l<i<j<n 
n 

= J2 r[a n ^ da] + T [a n -^- 1 da a^' 1 da] 

j=l l<i<j<n 

where we have used the trace property and combined terms on the left. The terms in the first sum 
do not depend on the summation variable j, and so this simply becomes nrfa"" 1 da]. In the second 
summation, the summands depend on the summation variables i,j only through their difference k = 
j — i which ranges from 1 up to n — 1. We therefore reindex by this variable, and that sum becomes 

n-l 

Y T[a n ^ j ^~ 1 daa^ j ~ i ^ 1 da] = Y^ Yl r K^" 1 daa^ 1 da]. 

1<*<J<™ k = l l<i<j<n 

j — i = k 

For fixed k E {I, ... ,n — I}, the number of pairs with j — i = k is equal to n — k, and so this 
summation S becomes 

71-1 

5 = J^(n - k)T[a n - k ~ l daa k ~ l da]. (2.3) 

k=l 

Reindexing j = n — k shows that this sum is also given by 

n-l 

S = ^irK" 1 daa n -3~ l da]. (2.4) 
i=i 
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Using the trace property, adding Equations 2.3 and 2.4 gives the simplification 

n— 1 n—1 

2S = J^(n - j + j) rla^- 1 da a^ 1 da] = n ^rja^'- 1 da a^ 1 da}. 

Thus, we have 

n-l 



r[d(a n )] = iiTfo^ 1 da] + ^ t^- 1 aW" 1 da]. (2.5) 

Now, at = qptq and so (applying the Ito product rule 1.21 twice) da t = qdp t q. To evaluate the differen- 
tial dpt, the product rule again gives 

dp t = d(u t pu*) = (du t )pu* t +u t [d(pu* t )} + {du t )[d(pu* t )}, 

and since p is constant with respect to time, 

dp t = {du t )pu* t + u t pdu* t + {du t )p{du* t ). 

We now substitute the stochastic differential equations 1.26 and 1.27 to express dp t as 

dp t = {iu t dx t - -u t dt)pu* t + u t p(-idx t u* t - ^u* t dt) + (iu t dx t - ^u t dt)p(-idx t u* t - ^v* t dt). 

The first two terms simplify to give 

iut dxtpu* t — iutpdxt u* t — utpv* t dt. 
The final term has only one surviving factor: by Equation 1.22 

(iut dxt)p{—idx t u* t ) = utdxtpdxtu* t = utr{p)v* t dt = r{p) dt. 

Altogether, then, we have 

dpt = —u t pv* t dt + iu t dxtpu* t — iutpdxt u* t + r{p) dt. (2.6) 

Recalling that r(p) = a and that utpu\ = pt, the first and last term combine to (a — pt) dt. For the 
middle terms, it is useful to introduce a new process. Let 

dy t = iu t dx t u* t . (2.7) 

While it is easy to see that this SDE has a unique solution y t (satisfying yo = 0), we need not concern 
ourselves with this fact. Indeed, all that is important is that the rules (Equations 1.22 and 1.23) of Ito 
calculus applied with y t instead of xt take the following form. If Q t is an adapted process, then 

dyt Qt dy t = (iu t dx t u*)6 t {iu t dx t u*) = -u t T(u*6 t ut)u* t dt = -r(0 t ) dt (2.8) 

and 

dy t dt = dt dy t = {dt) 2 = 0. (2.9) 

Now, Equation 2.6 can be rewritten as 

dp t = (a-p t )dt + dy t p t - Pt dy t . (2.10) 
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Thus, with at = qpt.q, it follows that 

da t = qdp t q= (aq - a t ) dt + q dy t ptq - qpt dyt q. (2.11) 
We now simplify the first term in Equation 2.5. 

a™ -1 da t = a^~ 1 (aq - a t ) dt + a™ -1 qdy t p t q - a%~ x qp t dy t q. 
By Equation 1.24, the last two terms have trace 0, and so we simply have 

r(a™~ 1 da t ) = r[a£ _1 (a? - a t )] dt = [ar(a™- 1 ) - r(a?)] dt (2.12) 

where we have simplified a1~ 1 q = (qptq) n ~ l q = (qptq) nl = since q 2 = q. 

For the second term (the summation) in Equation 2.5, it is convenient to make yet another transfor- 
mation. Define zt by 

dz t = q dy t p t q - qpt dy t q. (2.13) 

Again, one can use standard theory to show that there is a unique adapted process with zq = satisfy- 
ing this SDE, but this is not important for present considerations. The following lemma expresses the 
form of the Ito calculus in terms of the process zt- 

Lemma 2.1. Let z t be defined by Equation 2.13. Then dztdt = dtdzt = (dt) 2 = 0, and if 9 t is an adapted 
process, then 

dz t 6 t dz t = [-2T(a t 9t)at + T{a t d t )q + T(q6 t )at] dt. (2.14) 

Proof. Since zt is a stochastic integral, the Ito rules regarding product with dt apply as usual. For 
Equation 2.14, we simply expand 

dz t 9 t dz t = (q dy t p t q - qpt dy t q)0 t (q dy t p t q - qpt dy t q) 

= q dy t {ptqOtq) dy t p t q - q dy t (ptqOtqPt) dy t q - qpt dy t (qO t q) dy t p t q + qpt dy t (qO t qpt) dy t q. 

Applying Equation 2.8 to each of these four terms yields 

-q ■ T(p t q0tq) -Ptqdt + q- T{p t q6 t qpt) -qdt + qpt- T(q6 t q) ■ p t q dt - qp t ■ T(qO t qp t ) ■ q dt. 

Using the trace property, and simplifying with the relations q 2 = q,p\ = pt, and qptq = at, yields 

dz t t dz t = -T(a t 9t)a t dt + T(a t t )qdt + T(q6 t )a t dt - T(a t 9 t )a t dt, 

which simplifies to give Equation 2.14. □ 

Refer now to the summands in the second term in Equation 2.5. From Equations 2.11 and 2.13, we 
have 

dat = {otq — at) dt + dz t . 

Hence, for j e {1, . . . , n — 1}, 

oj 1- - 7-1 da t a{~ 1 dat = <h~^~ l — a t) dt + dzt] a{~ 1 [(aq — at) dt + dz t ]. 

We may expand this into four terms. However, since dt commutes with everything and, by Lemma 2.1 
dt dzt = 0, and as always (dt) 2 = 0, the only surviving term is 

a™ _J_1 dat <it~ dat = a™ - "' -1 dzt ai' 1 dzt- 
14 



Employing Equation 2.14 of Lemma 2.1, we therefore have 



n-j-l 



data{ 1 dat = a™ 3 L [—2T{al)a t + T{al)q + T{qa{ l )a t ]dt 



n—j—1 



3-l> 



Taking the trace, we have 



r(a 



n-j-l 



dotal l da t )= -2r(a^)r(o^ J ) + r(^)r(a™ J i g) + r(ga^ i )r(a™ J ) 



n-j-l. 



dt. 



(2.15) 



Provided j — 1 / and n — j — 1 / (i.e. n > 4 and j G {2, . . . , n — 2}), ga^ 1 = a{ 1 and ga£ 



n-j-l 



n-j-l 



, as per the discussion following Equation 2.5. So, in this regime, we have 



t(ci 



n-j-l 



da t a J t 1 da t )= -2r(a|)r(a t ri J )+r(o^)r(a™ J L ) + T(aj i )r(a™ J ) 



dt, 2 < j < n - 2. 

(2.16) 

The case j = 1 corresponds to r(a™ 2 dat dat), while j = n — 1 corresponds to r(dat a n ~ 2 dat). By the 
trace property, these are equal. In each case, one of the g-terms is r(a™~ 2 g) = T(a™~ 2 ), while the other 
is r(a° q) = r(q) = (3. So, we have 

r(a r ^- Y da t a{~ 1 da t ) = [-2T{a t )T(a^ 1 ) + T(a t )r(a^ 2 ) + ^(a^ 1 )} dt, j € {1, n - 1}. (2.17) 

Hence, the second term in Equation 2.5 is equal to 



n[-2r(a t )T(a n t - 1 ) + r(a t )T(a^ 2 ) + /3r(at )] dt 

1 71-2 

+-nJ2 [-2r(4)r(ar j ) + ^M^' 1 ) + T^Ma^ )] dt 



(2.18) 



i=2 



where the summation is taken to be in the case n < 3. Combining Equations 2.12 and 2.18 with 
Equation 2.5, and using the notation g n (t) = r(af), we therefore have 



dg n = T[d(a n )} = [nag n -i -ng n - 2ng x g n -i + ngig n - 2 + n(3g n ^i]dt 



n-2 



+ \ n ^Z \.~ 2 9j9n-j + gjg n -j-i + gj-ig n -j] dt. 



(2.19) 



i=2 



From here we see that g n is actually differentiable, and that Equation 2.19 is a differential equation. 
One more simplification is in order. By making the change of index k = n — j, note that 



71-2 



n-2 



^2/9j9n-j-l = /,9n-k9k-l- 
j=2 k=2 

Hence, the second and third summands in the second line of Equation 2.19 have the same sum, and 
we have 



n-2 



9n 



-n 



g n + n(a + fi)g n -i - 2ngxg n -\ + ngig n - 2 + -n^[-2g 3 g n _j + 2g j ^ 1 g n _ j 



(2.20) 



i=2 
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We can then recombine the third and fourth terms (just before the summation) as follows: 



n-2 



2ngig n ^i + ngig n - 2 + ^^[-2ffj g n -j + 2g j ^ig n - j ) 



3=2 



n-2 



n-2 



-ngig n -i - 9j9n-j - ns n _isi + n gi g n „ 2 + n 

9j — l9n- 

3=2 J \ 3=2 

n—1 n— 1 

= ~ n Yl 9 3 9n-i + n Yl 9j-l9n-j ■ 
3=1 3=2 

Combining this with Equation 2.20, we are led to the following result. 

Proposition 2.2. Let {g n : n > 1} be defined as in Equation 2.1, and set go = 1. Then g n is continuous 
on [0, oo) and differentiable on (0, oo) for each n. Furthermore, the functions g n satisfy the following infinite 
system of ordinary differential equations. 



9i = -9i + a/3 



92 



-2g 2 + 2(a + f3) 9l - 2g\ 



n-l 



n-1 



-ng n + n(a + P)g n -i - n^Jffjg n _j + n gj_ig n _j, n > 3. 

3=1 3=2 



(2.21) 
(2.22) 

(2.23) 



Note that similar equations appear in [9] (see also [15, 29]). It will be convenient to define go = a + (5. 
With this convention, Equations 2.22 and 2.23 can be written in the more compact form 



9n = ~n 



n-l 

9n + ^2(9j -9j-l)9n-j 

3=1 



n> 2. 



(2.24) 



2.2 The Proof of Propostion 1.16 

Our goal here is to calculate the right-derivative of F t (s) = r[{r t — r t p s rt) 2 ] at s = t. The process 
s h> (rt - r t p s rt) 2 is adapted for s £ [t, oo), and so we may use the tools of Ito calculus to compute this 
derivative. To begin, we expand 

(r t - r t p s r t ) 2 = r 2 - r 2 p s r t - r t p s r 2 + (r t p s r t ) 2 = r t - 2r t p s r t + r t p s r t p s r t . 

Recalling that t is constant, the stochastic differential is 

-2r t dp s r t + d(r t p s r t p s r t ). 

Using the Ito product rule 1.21, the second term expands to 

d[(r t p s rt)(p s rt)] = (r t dp s rt)p s rt + r t p s rt(dp s rt) + d(r t p s r t ) ■ d(p s r t ) 
= r t dp s r t p s r t + r t p s r t dp s r t + r t dp s r t dp s r t . 

Combining and taking the trace, this yields 

dF t (s) = r[d(r t - r t p s r t ) 2 ] = T[-2r t dp s r t + r t dp s r t p s r t + r t p s r t dp s r t + r t dp s r t dp s r t ]. 
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Using the trace property (and the fact that r t = r 2 ), this simplifies to three terms: 

dF t (s) = -2T(r t dp s ) + 2T(r t p s r t dp s ) + T(r t dp s r t dp s ). (2.25) 

From Equation 2.10 above, we have dp s = (r(p) — p s ) dt + dy s p s — p s dy s where the process y s is 
determined by fSDE 2.7, and obeys the ltd calculus of Equations 2.8 and 2.9. 

We consider now the three terms in Equation 2.25 separately. First: 

T(r t dp s ) = [-T(r t p s ) + r{p)r t ] ds + ir(r t dy s p s ) - ir(r t p s dy s ). 

The last two terms are 0: each can be expressed in the form t(9 s dx s ) where 6 S is adapted (since s > t), 
and I to integrals are centered. Thus 

r(r t dp s ) = [-T(r t p s ) + r(p)r(r t )] ds. (2.26) 

Now for the second term in Equation 2.25. Combining with Equation 2.10, we have 

T(r t p s r t dp s ) = [-T(r t p s r t p s ) + T(r t p s r t )T(p)] ds + iT(r t p s r t dy s p s ) - iT{r t p s r t p s dy s ). 

Again, the last two terms are of the form t{9 s dx s ) for adapted 9 S , and so we have 

T(r t p s r t dp s ) = [-T((r t p s ) 2 ) + T(r t p s )\ ds. (2.27) 

Finally, we come to the third term in Equation 2.25. We must calculate the trace of 

{r t dps) 2 = ([-r t p s + T{p s )r t ] ds + r t dy s p s - r t p s dy s ) 2 . 

All products involving the ds term vanish, since ds 2 = dsdy s = (cf. Equation 2.9). Thus, we simply 
have 

(r t dp s ) 2 = (r t dy s p s - r t p s dy s ) 2 

= (r t dy s p s - r t p s dy s ) 2 = r t dy s p s r t dy s p s - r t dy s p s r t p s dy s - r t p s dy s r t dy s p s + r t p s dy s r t p s dy s . 
Now using Equation 2.8, these terms simplify as 

• r t dy s p s r t dy s p s = -T(p s r t )r t p s ds 

• r t dy s p s r t p s dy s = -r(p s r t p s )r t ds = -r(p s r t )r t ds 

• r t p s dy s r t dy s p s = -r t p s T(r t )p s ds = -T(r t )r t p s ds 

• r t p s dy s r t p s y s = -T(r t p s )r t ps ds 
Summing and taking the trace we have 

T[(r t d Ps ) 2 ] = [-2(r(p s r t )) 2 + 2r{r t )r{p s r t )} ds. (2.28) 

Combining Equations 2.26, 2.27, and 2.28 with Equation 2.25, we have 

dF 

-±(s) = -2[-r(r tPs ) + T{p)T{r t )\ + 2[-r({r tPs ) 2 ) + r(r t p s )] + [-2(r(r t p s )) 2 + 2r(r t )r(r t p s )] 
ds 

= 2 (2r(r tPs ) + r(r t )[r(r tPs ) - r(p)] - [r((rp s ) 2 ) + (r(rp s )) 2 ]) . 
Evaluating at s = t, we get (using r t = r 2 = r t pt) 
1 dF 

-^(t) = 2r{r t p t ) + r{r t )[r{r t p t ) - r(p)] - [r((rp t ) 2 ) + {r{rp t )?\ 

= 2r(r t ) + T(r t )[T(r t ) - r(p)} - [r(r t ) + r(r t ) 2 ] 
= T(r t ) - T(r t )T(p) = r(r t )(l - r(p)) 
as claimed in the corollary. □ 
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2.3 Smoothness of the Moment Generating Function in t 

Our next goal is to prove that the (centered) moment-generating function of fit 



(2.29) 



n>l 



is C°° jointly in (t, w) for t > and \w\ < 1. The moments <? n (i) are solutions to Equations 2.21-2.23, 
which can, in principle, be solved explicitly. Presently, we only use the fact that the solution has a 
simple analytic form. 

Lemma 2.3. For n > 1, the function g n (t) ofPropostion 2.2 is a polynomial in t and e~*. 

Proof. To begin, we may solve Equation 2.21 explicitly: gi(t) = gi(0)e~ l + a/3(l — e _i ), having the 
desired form. We proceed by induction on n. Equations 2.22 and 2.23 give, for n > 2, g' n + ng n = h n 
where h n is a polynomial in g±, . . . , <? n _i, and is therefore a polynomial in t and e - * by the induction 
hypothesis. The ODE can then be written in the form ^[e nt g n {f)} = e nt h n (t), with solution 



9n(t) 



- at 



e ns h n (s)ds + g n (0) 



Since h n (s) is a polynomial in s and e~ s , e ns h n {s) is a polynomial in s and e ±s whose positive degree 
in e s is < n. A separate induction argument and elementary calculus show that the antiderivative of 
e ns h n (s) is therefore also a polynomial in s and e ±s whose positive degree in e s is < n. Thus g n (t) has 
the desired form, proving the corollary. □ 

We can also iterate the ODEs 2.21-2.23 to find a general recurrence form for the kth derivatives. 

Lemma 2.4. Let {g n }n>i be defined as in Equation 2.1, with go = a + j3 as in Equation 2.24. Then for allt > 
and n, k > 1, the kth derivative g^ satisfies 



9^ 



fc+i 

Yl Yl c Ui,---,js)h jl ---h js 

S=0 l<Jl,...,Js<Jl 



(2.30) 



,3s)\ < n s 



where each function hj is one of±gifor < i < n, or a constant < 1 in modulus, and \c(ji, 

(k) 

Proof. When n = g = 0; when n = 1, iterating Equation 2.21 shows that g\ - 
which has the given form (since a/3 < 1). For n > 2, we proceed by induction on k, with the base case 
provided by Equation 2.24, which we repeat here for convenience: 



-l) k [ 9l - af3], 



9n 



-n 



n-l 

9n + ^(ffj - 9j-l)9: 



n-j 



n>2. 



(This takes the form given in Equation 2.30, where the s = term is 0, for s = 1 only the single 
term g n survives with constant c(n) = —n, and for s = 2 the non-zero terms have c(j, n — j) = n, 
c(j — 1, n — j) = — n for 1 < j < n, and all other c(j\, 22) = 0.) Now taking Equation 2.30 as the 
inductive hypothesis, we compute 



(fc+i) 



k+l 

s=0 1<j'i ,js<n 



1 3 s 



dt 



.n 



(2.31) 
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The product rule gives the interior derivative as a sum of s terms which, after relabeling, each take the 
form h'jjife • • • hj s . By the inductive assumption, hj x is either constant or ±gi for some 1 < i < n; hence 
its derivative is either (if hj 1 is constant, including the case i = 0) or, by Equation 2.24, 



9i + Y?e=i(9e ~ 9t-\)9 



i-i 



if i > 2 



-gi + a/?, if i = 1. 



Thus h'^h^ • • • hj a is a sum of terms of the form 0, or otf3hj 2 ■ ■ ■ hj s , or —g\hj 2 • • • hj s , or =figihj 2 ■ ■ ■ hj s , or 
zs r'i'9i9i-ihj 2 ■ ■ ■ hj s , or finally ±.igg_\gi_(hj 2 ■ ■ ■ hj s . Each of these is a scalar multiple of a product of s — 1, 
s, or s + 1 functions hj of the correct form. Combining these into the interior sum in Equation 2.31, we 
produce terms with between 1 and s + 1 h^s, with coefficients of the form either 0, or a(3c(ji, . . . , j s ), 
or — c(ji, . . . , j s ), or ±ic(ji, . . . ,j s )- Since \c{j\, . . . ,j s )\ < n s by assumption and 0, a/3, l,i < n, these 
new coefficients have absolute value < n s+1 , confirming the desired inductive form and completing 
the proof. □ 

Next, we use the equations to prove the following family of (blunt) growth estimates for the deriva- 
tives of the moments g n (t)- 

Lemma 2.5. Let {g n }n>i be defined as in Equation 2.1. Then for each k G N, the kth derivative g^ is uniformly 
bounded by 

\ g W\ < (k + 2)2 k+1 n 2k+1 . 

Proof. For all n > 1, g n {t) is the nth moment of a probability measure fit supported in [0, 1]; hence 
|<?n(i)| < 1- By the convention used in Lemma 2.4 go = a + f3 which is in [0,2]. Hence, in general 
|<7n(i)| < 2 for all n > 0. Thus, referring to the recursive form of the derivative given in Equation 2.30 
in the previous lemma, we have 

fc+i 

\9 { n ] \<Yl Yl \<h,---,js)\\h jl \---\h js \ 
s=0 l<Jl,...,.Js<W 

where each hj is either a constant < 1 in modulus or ±gi for some < i < n; thus we also have \ hj\ < 2 
for all j. The lemma also asserts that the coefficients \c(j\, . . . , j s )\ < n s < n k+1 , and so 



fc+i fc+i fc+i 

\g { n k) \ <n k+1 Y, E 2 s <2 fc+ V +1 ^ l = 2 fc+ V +1 ^n s < (A: + 2)2 

s=Q l<ji,...,j s <n s=0 l<j\,...,j s <n s=0 



fc+l n 2fc+l 



as desired. □ 

Remark 2.6. A more careful estimate is possible, showing that \gr^\ < Cfcn 2fc for some exponentially 
large constant c^; for example, direct estimation of Equation 2.23 gives 

(re— 1 re— 1 

bnl + (a + P)\g n -i\ + E \9j\\9n-j\ + E \9j~i\\9n-j\ 
3=1 3=2 

= n (1 + a + [3 + (n - 1) + (n - 2)) < 2n 2 

smaller than the 12n 3 bound proved in above. The improvement this bound represents over the bound 
proven in Lemma 2.5 is of no consequence to our application, however. 
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We now prove the main theorem of this section: that the (centered) moment-generating function 

t/)(t, w) = Yl n >i 9n(t)w n is C°° in both variables. 

Proposition 2.7. The function tp(t, w) of Equation 2.29 is C°° jointly in both variables, for t > and \w\ < 1. 

Proof. Since the coefficients g n {t) are uniformly bounded in modulus by 1, the power series ijj(t, •) 
converges uniformly on compact subsets of D, and analyticity is an elementary result from complex 
variables. Now, for k > 1 and m > define 

oo 

V>k,m(t, w) = 9 { n\t) ■ n{n - 1) • • • (n - m + l)w n ~ m . 

n=m 

Throughout this proof (only), we use the non-standard convention that 0° = 0; thus 990,0 = ip- For fixed 
t > 0, by Lemma 2.5, the coefficients of this power series in w are bounded by 

\g { r k \t) ■ n(n - 1) ■ ■ ■ (n - m + 1)| < (k + 2)2 k+1 n 2k+1 ■ n rn 

and so by the root test, the power-series converges uniformly on compact subset of the unit disk D, 
defining a function analytic of w. Similarly, for fixed < 1, ifk,m(t, w) is absolutely summable; it is a 
series of functions that are continuous (by Lemma 2.3) and hence, by the Weierstrafi M-test, tpk,m(t, w) 
is continuous in t. This shows that (fk,m 

The uniform convergence of <pk m and pointwise convergence of <pk-i,m ( a t an y single point in D) 
imply (for example by [24, Thm. 7.17]) that ip^ m (-,w) is differentiable for each w and that 



By induction on fc, this shows that (po jm (t,w) is at least C k in t for each A;, and has time derivatives 

Qk 

dt k ' 



Wk<-Po,m{t, w) = (pk, m (t, w). Of course, by elementary complex variables, 



Qm 00 Qm 
<Pk,m(t,w) = Q^^g ( n ) (t)w n = —<p kfi (t,w); 
n=l 



thus, we have shown that 



Qm Qk Qm Qk 

for m, > 0. As we have shown that <p k ,m G C°(IR + x D), this proves that -0 is C°°(IR + x D) as 
required. □ 

Our next goal is to extend this result to analyticity of ip. Analyticity in the spacial variable w € D 
follows immediately from the proof of Proposition 2.7. Analyticity in t is much more involved. It will 
pay to first translate the ODEs of Equations 2.21-2.23 into a PDE for the Cauchy transform of /if (which 
is simply related to tp(t, •)), and then use analytic function techniques in the spacial variable; this is the 
purpose of the next two sections. 

Remark 2.8. One might hope to use the bounds of Lemma 2.5 to prove analyticity in t using Taylor's 
theorem. Indeed, the order iV Taylor polynomial of tp(-,w) centered at to is 

N 1 d k N 1 

Tg i/>(t, w)=J2 fcj Q^fo' «0(* - *o) fc = Yl ^k,o(to,w)(t - t ) k . 

fc=0 ' k=0 
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The coffieicents are ^ times 

oo 

<Pk,o(to,w) = ^g^it^w 71 - 

n=l 

Using the bound of Lemma 2.5, we have 

oo oo 

Ko(*o,™)l < J2\9 ( n ] (to)\\w\ n < (fc + 2)2 fe+1 £ 

n=l n=l 

However, the sum is bounded below by 

-(2k+2) 



n 2k+1 \w\ n . 



^n^\wr> (in ^y^' (2k + 1)! 



n=l 



(this comes from interpreting the sum as a right end-point Riemann sum for an integral which, after a 
substitution, transforms into the Gamma function). But this yields a bound for the Taylor coefficents 
that grows too fast with k: 

(k + 2)2 k+1 (in T^i) {2k+2) ( 2k ~^~ ^ ! | oo super-exponentially. 



\w\J k \ 

Hence, this elementary approach cannot be used to prove analyticity in t. In Section 2.5, we will use 
more sophisticated techniques to prove that these Taylor series coefficients are, in fact, exponentially 
bounded. 

2.4 The flow of the Cauchy transform 

The moment function ijj(t, w) of Equation 2.29 is closely related to the Cauchy transform of the measure 
fit, which concerns us in this paper. Indeed, the coefficients g n (t) are the moments of fit. Since ip(t, w) 
converges uniformly for w £ B, taking z = 1/w for «j/0, we see that i/j(t, 1/z) converges uniformly 
for \z\ > 1, and hence by the Fubini-Tonelli theorem, 

/ Ef-)" ^ dx ) = E4r / x n fi t (dx) = Y^g n (t)^=i;(t,l/z), \z\ > 1. 

J ° n>l KZJ n>l Z J ° n>l * 

On the other hand, since the support of fit is contained in [0, 1], for \z\ > 1 the series Yln>i( x / Z ) n 
converges, and we have 



U{t, 1/z) = - f 1 (-^— - l) fi t (dx) = f 1 f^l-l = G, t (z)--, \z\>l. 
z z J \1-x/z J J z-x z z 

Finally, this means that the function G of Equation 1.8 is related to ip by 

G(t, z) = Gm {z)-\ + mm{ " ,/3} = - Wt, 1/z) + min{a, /3}) . (2.32) 
From Equation 2.32 and Proposition 2.7 it follows immediately that 

Corollary 2.9. The function G(t, z) of Equation 1.8 is C°°(]R + x (C — B)), and for each t > G(t, z) is analytic 
in \z\ > 1. Moreover, on this domain, 

f G (M) = !W) = £^ (2.33) 

n>l 
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We now combine Equation 2.33 with Equations 2.21-2.23 to deduce a partial differential equation 
satisfied by G. To begin, 



% G = E ^+T9n = ^(-91 + <*P) + ^(-2g 2 + 2(a + /3) 5l - 2«£ 



9 



n>l 

. / n— 1 n—1 

+ J] ~ n 5" + n ( a + Z 3 )^-! - n 9j9n-j +n^2 9j—l9n-j I • 

n>3 * V j=l j=2 



By the uniform convergence on the domain \z\ > 1, the order of all summations may be interchanged. 

9t' 



It is convenient to recombine the expression into two parts: -j^G = S\ + S2 where 



a/3 ^ ng n ^ 77,# n _i 

n>l n>2 

and 

2 2 n— 1 n—1 

5 2 = " - E ^TT E ft'^-i + E ^TT E 9j-\9n-y 

n>3 j=l n>3 j=2 

To deal with the first sum Si, note (from Equation 2.32) that 

r><+ \ ^ / ( 4- ~\ 1 \ 1 • r <m min{a,/3} ^ g n (t) 
G{t, z) = ~(<p{t, 1/z) + mm{a, /3j) = + ^ 

Z 2 n>l Z 



For convenience, let us define 



CM = Ei = *.)- = G„ W - I. (2.34) 



z' 

n>l 



Then 4d = 4G = Si + S 2 . Now, 



y =dy?iL = V( zGx) (2.35) 

n>l n>l 



which holds true for \z\ > 1. Similarly, 

ng n -i sr^ ( n + 1 / f™ , n 



~n+l ~~ z n+2 ~ ~ I 2-/ *n+l ^— Z n4_1 ^ 

n>2 n>\ \n>l n>l 



and employing Equations 2.34 and 2.35 this becomes 



n>2 

Combining Equations 2.35 and 2.36 and simplifying, this shows that 



^ + G 1 + (z-a-fi%-G 1 . 

oz 
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(2.37) 



(2.38) 



The terms in 52 can similarly be expressed in terms of G\. in fact, in terms of G\ . For \z\ > 1, 

\n>l ) n>2 j=l 

Whence, for \z\ > 1, 

d d 1 n ~ 1 n n ~ 1 

ai (zGl)2 = E E *' 5n ^ = " E ^TT E *Sn-j 

n>2 j"=l n>2 j'=l 

n-1 

Z 2 \ — ^ \ — ^ 

= ~ 7^+1 Z^9j9n-j- 

n>3 j=l 

Regarding the second term in 52, note that for each n > 3 

n-1 

^2gj-ig n -j= 9 'J gk - 

j=2 j+k=n-l 
j,k>l 

Whence, we can manipulate the sum as 

(zd) 2 = Y.^Y. 3j9k = E 3j9k, 

n>2 i,h>l n>3 j+k=n-l 

j + k=n j,k>l 

and so 

|(^) 2 )=|E^T E ^ = -E^£Vi^i (2-39) 

n>3 j+*=n-2 n>3 1=1 

j,k>l 

holds true for \z\ > 1. Combining Equations 2.38 and 2.39, we have 

5 2 = ^(,G 1 ) 2 -^(.(G 1 ) 2 ). (2.40) 
We have thus proved the following result. 

Proposition 2.10. Let G\(t, z) be defined (for t > and \z\ > I) as in Equation 2.34. Then G\ satisfies the 
partial differential equation 

8 ^ d ( , n2 , / o\n a P 



d fi = q- z [z{z - l)(Gi) 2 + (z-a- P)G X - -f ) . 

Proof. As noted following Equation 2.34, J^Gi = Si + 52- Simplifying Equation 2.37 reversing the 
product rule, 

5i = + G\ + (z — a — P)q^Gx = — ( + (z-a- / 9)G ] 



On the other hand, Equation 2.40 immediately yields 

S 2 = §- z (zG 1 ) 2 -§- z (z(G 1 f) = §- z ((z 2 -z)Gi). 
Combining these equations, which are valid for \z\ > 1, yields Equation 2.41. □ 
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Remark 2.11. Since = JjG^, we can rewrite Equation 2.41 in terms of the Cauchy transform 

directly; it is easy to check that the result is 



While we have shown the equation holds only in the regime \z\ > 1, we will show below that it actually 
holds true on all of C+. Hence, the poor behaviour near z = becomes a technical issue. It is partly 
for this reason that the shifted transform G(t, z) = G fH (z) — l ~ mm ^ a <^ i s useful: it encapsulates the 
singularity in a static form, leaving a smooth flow in the vicinity of 0, as demonstrated by the form of 
Equation 2.43 below. 

Corollary 2.12. Let G be the shifted Cauchy transform of Equation 1.8. Then for t > and \z\ > 1, 

^G = [z(z - l)G 2 - (az + b)G] (2.43) 
where a = 2 min{a, 8} — 1 and b = \a — (3\. 

Proof. From Equation 2.34, G\ = G — mm ^'^ . Consider the case a < 8, so G\ = G — j. We simply 
change variables in Equation 2.41, which holds for \z\ > 1. As JjGi = J^G, we need only transform 
the quantity differentiated with respect to z in Equation 2.41. 

z(z-1)(G!) 2 + (z-q-/3)Gi-^ 

^_ 1)(G _2)v (a _ a _, )( ;^)_f 

=z(z - I) { G G + -?)+(z-a - B)G -a + — — - 

\ z z z J z z 

, ,,^o / n n , ^ en 2 , iN a(a + 8) ad 
=z(z - l)G 2 + {z-a- 8- 2a{z — l))G-\ (z - 1) + — — - a 

z z z 

=z{z - l)G 2 + ((1 - 2a)z + a - 8) G + a 2 - a. 

Differentiating yields Equation 2.43 in the case a < 8; the reader may readily verify the formula also 
holds true in the case a > 8. □ 

Remark 2.13. Without prior knowledge of the structural singularity in the measure fit a s t — > oo (cf. 
Equation 1.6), one might simply try to change variables by removing a pole of unknown mass at 0: 
G = G\ + — . Easy calculations show that the only masses m that transform Equation 2.41 to a form 
without an explicit singularity at are m = a and m = 8; hence, the choice here is natural. 



2.5 Analyticity of the Cauchy Transform in t 

Our goal in this section is to extend Corollary 2.9 to show that G(t, z) is not only C°° in t but, in fact, 
analytic in t for \z\ > 1. To do so, we will actually use the PDE 2.43 proved to hold in Corollary 2.12, 
together with our a priori knowledge of analyticity in z. 

Remark 2.14. PDE 2.43 is a semilinear complex PDE similar in form to the complex inviscid Burger's 
equation. In general, it exhibits all the hallmark pathological behaviour of nonlinear PDEs: blow-up 
in finite time, even with uniformly bounded initial data, and shock formation. Since the equation is 
non-linear, Holmgren's uniqueness theorem does not apply. Thus, although the Cauchy-Kowalewsi 
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theorem proves the existence of an analytic solution with given analytic initial condition, it is only 
unique amongst potential analytic solutions, and there may well be non-analytic solutions with the 
same initial data G(0, z), so the C°° result of Corollary 2.9 does not immediately prove analyticity. In 
this section, we use the form of the PDE, together with the a priori knowledge of analyticity in z, to 
show that our particular solution G(t, z) is, indeed, analytic in t as well. 

We begin with the following recursion for the t-derivatives of G. 
Lemma 2.15. For t > and \z\ > 1, and for k > 0, define 

1 d k 

U dt k K ' 

Let j> = p(z) = z(z — 1) and q = q(z) = —(az + b), cf. Equation 2.43, so that the C°° 
G satisfies the PDE dtG = d z \pG 2 + qG] on its domain. Then 



x (C-D)) function 



+ l)G k+1 = |- 
az 



py^GjGk-j + qG k 



k>0. 



(2.44) 



Proof. The case k = is the statement of PDE 2.43. Proceeding by induction, since G is C°° we may 
commute t and z derivatives. We have 



1 d k+l _ d 

{k + 2)\dt k+2 ~ ~ dt{k + \)\dt k+lG= di Gk+1 ' 



(2.45) 



and so by the inductive hypothesis 

fe 



(k + 2)G k+2 



1 d d 
k + ldidz~ 



p^GjG^j +qG k 
j=0 



d_ 

dz 



k + 



i=o 



Shifting the index down one in Equation 2.45 shows that fctrjJjGfc = G k+ \, as desired. For the 
quadratic term, we use the product rule. Again utilizing Equation 2.45, 

r) f)C BG 

— (GjGk-j) = -Q^-Gk-j + Gj = (j + l)G j+1 G k -j + (k-j + l)G 3 G k ^ +l . 

Thus 







dt 

j=0 j=0 j=0 

k+l k 

= ^2iGiG k - i+1 +^2(k-j + l)GjG k -j+i 
i=l i=o 

where we have made the substitution i = j + 1 in the first sum. Separating off the i = k + l term in the 
first sum and the j = term in the second sum, and relabeling j — > i in the second sum, this yields 







]T —{GjGk-i) = {k + l)G k+1 G + £ iGiG k - i+1 + - i + l)GiG k ^ +l + (k + l)G G fc+1 



i=o 



8=1 



i=l 
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which simplifies to 

fe+i 



^(G,G H ) = (H1) 

3=0 



Gk+iGo + ^2 GiGk+i-i + GoGk+i 



i=i 



i=0 



Combining with the equation following 2.45 and the following comment yields the result. □ 

We will use the recursion Equation 2.44, in conjunction with analyticity of G, to prove much tighter 
bounds on the derivatives Gk(t, z) than those discussed in Remark 2.8, allowing us to prove conver- 
gence of the Taylor series of G(t, z) centered at any t > 0. First we show that the i-derivatives G k are 
also analytic. 

Lemma 2.16. For each k > 0, and each t>0, the function Gk(t, -)from Lemma 2.15 is analytic on C — D. 

Proof. First, Go(t, •) = G(t, •) is analytic on C — [0, 1] (cf. Equation 1.8 and Definition 1.10), verifying the 
claim in this base case. We proceed by induction. Let K be any compact subset of C — D, let to > 
and let < e < t . By Corollary 2.9, G k (t,z) is C°° for t > and \z\ > 1, and thus \G k+1 (t,z)\ = 
(fc+i)! I Qt.k+i G(t, z)\ is uniformly bounded on [to — e,to + e] x K. Now, for \h\ < e, the fundamental 
theorem of calculus asserts that 

G k {t + h,z)-G k {t ,z) f 1 8 



f 1 d f 1 
J -^G k (t + sh, z) ds = (k + 1) J G k+ i(t + sh,z)ds 



h 

with the second equality following from Equation 2.45. Thus we have 

G k (to + h,z) - G k (t ,z 



h 



<(k + l) sup \G k+1 (t,z)\. (2.46) 

|t-i |<e 



Let < e be any sequence tending to 0, and let h n (z) = ^-[G k (to + h n ,z) — G k (to,z)]. By the infuctive 
hypothesis, h n is analytic on a neighborhood of K, and Inequality 2.46 shows that the family {h n } is 
uniformly bounded on K. By Montel's theorem, there is a subsequence that converges normally to an 
analytic function on K. But the seuqence h n converges to dtG k (to, •) = (k + l)G k+ i(to, •)■ This proves 
that G k+ \ is analytic on K, and hance on the domain C — D as claimed. □ 

Before proceeding to the main estimates, we state and prove a lemma which is a version of the 
Cauchy estimates from complex analysis. 



Lemma 2.17. Let zq g C and rj > 0. If h is analytic on a neighborhood of the closed disk B(zo,r?), and 
< rf < 7], then 

\h'{z)\<—^— max \h(Q\, for all z G B(zo,rf). (2.47) 

V~V C6BOo,r?) 



Proof. From the Cauchy integral formula, if r > is such that h is analytic on a neighborhood of D(z, r), 
then 

2?" J d n(z,r) (C - zY 



Thus 



\ti(z)\ < — I il^.l dC = 1 \H()\d( < ■ 2irr ■ max \h(Q\. (2.48) 
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If z £ O(zo) ?/)/ then the closed disk D(z, 77 — 7/) is contained in D(^o, 77) where /i is holomorphic; so, 
taking r = 77 — 7/ in Inequality 2.48 yields 

< 7 max \h(0\<^—7 max |/i(C)| 

as desired. □ 
Let us introduce the following maximal functions. For k > 0, t > 0, \z \ > 1, and < rj < \zq\ — 1, 
M k (t ,z ;7]) = max \G k (t ,Q\, M' k (t , z ;r]) = max iGjfai.OI- (2-49) 

CeB(z ,»7) C6B( 20 ,»7) 

When the point (to , ^o) is understood from context, we will shorten the notation to M k (77) = (to , zq ; 77) 
and M' k {rj) = M' k (to, zo; r/). By the analyticity result of Lemma 2.16, the Cauchy estimates of Lemma 
2.17 yield the following maximal Cauchy estimate: 

MLW) < ^4- (2.50) 
77 — 77' 

We now use Inequality 2.50, together with the recursion of Equation 2.44, to prove exponential-growth 
bounds for G k inductively. The key idea is to apply the maximal Cauchy estimate repeatedly with 
an appropriately choen 77" £ (77', 77). It is important that 77" be chosen to minimize the bound for each 
individual term, or else the resulting estimates blow up super-exponentially. 

Proposition 2.18. Let to > and \zq\ > 1, and let < 77 < min{|zo| — 1, 1}/ so that the disc H>(zq, 77) of radius 
77 centered at zq is contained in C — ID). There is a constant c = c(zq, rj) so that, for all < 77' < 77 and all k > 0, 

M fc 7/' < — } V ^ 2 • (2.51) 

(77 — 77' ) K 

Remark 2.19. It will be important that the constant c does not depend on to- Indeed, we will see that c 
does not depend on the value of Go at all; it can be taken to equal 52 times the maximum modulus of 
the polynomials p, q,p', q' on B(zq, rj), withp(z) = z(z — 1) and q{z) = —{az + b). 



Proof. The function Go (to, ■) is analytic on a neighborhood of 0(zq, 77), so it is uniformly boudned by 
Mo (77) on the disk, which proves the inequality in the base case k = 0. We proceed by induction. Fix 
k > and suppose that we have shown that, for < i < k, there are constants c# = ^(77, zq) so that 

M t (rf)< f 0<£<k. (2.52) 

Proceeding to k + 1, we use the recursion of Equation 2.44, which we now expand: 





(k + l)G k+1 = — 



k 



3=0 

k k 

p' GjG^j + 2pJ2 G'jG^j + q'G k + qG' k 

j=0 j=Q 



(2.53) 
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We will bound each term in this recursion separately. Recall that p(z) = z(z — 1) and q(z) = —{az + b) 
are polynomials. Hence there is a constant A => 1 so that max{|p(z)|, |i/(z)|, |<z(z)|, k'^))} < A for all 
z G 0(2:0, rf). Thus, on H(zq, rj'), 



k-j 



3=0 



< A^IG.IIG^I < X^M^M^r,') < 



3=0 



3=0 



3=0 



(rj — rj') 3 (rj — r/') k 3 



Hence, the first term is bounded by 



3=0 



•-3 



(A _ A , )fc ^ J - (r? _ ^ 

where the final inequality is justified by the assumption that rj < 1 so that rj — rf < 1. 
For the second term in 2.53, we can make the initial estimate 



(2.54) 



3=0 



< 2A ]>>j( ? /)M fc _, ( r/) < 2Ag ( ^- J )fc _. Afj(T/) 



j=0 j=0 

Now, for any 77" G {rf , rf), Inequality 2.50 with 77" in the role of rj yields 

77 — 77 

Now applying the inductive hypothesis Inequality 2.52, this time with 77" in the role of rf, yields 



(2.55) 



Mj(v") < 



(rj — r)") 3 



(2.56) 



Thus, we have the estimate 



3=0 



< 



c j c k-j 



3=0 



(rj" — rf)(rj — rf')- '(77 — r]') k 3 



(2.57) 



which holds for any rj" G (?/, rf). We now optimze the inequality over rf' separately in each term in the 
sum. By elementary calculus, we find that the minimum occurs at rf' = -=^rf + j^j 7 ?- At this point, 



ri-rj" = ■ffiiv-r/), rj" - rj' = ^(rj-rj'). 



Thus, 



inf 



if<r)"<ri (rj" — Tj')(rj — rf') 3 



if j {3 + 1) 



i+i' 
1 



e ■ + 1) 
(rj - rj'y +1 ~ (rj - r?') i+1 ' 



< 



(2.58) 



Inserting these estimates into the terms in Inequality 2.57, we have 



3=0 



< 



2A£ 

3=0 



e ■ (j + 1) • CjC k -j 
(rj — rj') 3+1 (rj — rf) k ~ 3 (rj — 77') 



-2eA(fc + l) c 3 c k~j- 
3=0 



(2.59) 
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The third term in 2.53 is straightforward to estimate: 



Ac fc 



(2.60) 



again using the assumption 7/ — 7/ < 1. 

For the fourth term, we use the same approach as the second term. First we have \qG' k \ < XM' k (X'). 
Let A" = £T7jV + ]frj'?- Then Inequalities 2.55,2.56, and 2.58 give 



Finally, combining Inequalities 2.54, 2.59, 2.60, and 2.61 with Inequality 2.53 shows that 

1 



(2.61) 



(fc + l)|G fc +i| < 



and thus 



(77 — rj') k+1 



M k+1 ( V ') < 



k k 
A CjCk-j + 2eA(fc + 1) ^ CjCk-j + \c k + e\(k + l)c k 

3=0 3=0 



1 



(v - V') 



(1 + 2e)A 



j=0 



This completes the induction to show that Inequality 2.52 holds additionally for I = k + 1, provided 
that 

fc 



Cfe+i > (1 + 2e)A 
To this end, we now recursively define 



j=0 



(2.62) 



Cfc + i = 2(1 + 2e)A^^ c j c k-j for /c > 0, Co = max{Mo(7j), i}. 

j=0 



Note that 2cq > 1 and (by induction) cj > for all j, and so 



c k+1 = 2(1 + 2e)A 



^! C j C k-j + CjCfc_ 



3=0 



3=0 



> (1 + 2e)A 



> (1 + 2e)A 



3=0 
k 



3=0 



(2.63) 



as desired. Thus, to conclude the proof, it only remains to show that the constants defined by Equation 
2.63 are bounded by the given exponential form. In fact, it is straightforward to verify that the solution 
to the recursion Equation 2.63 is given by the scaled Catalan numbers 



Ck 



(2(1 + 2 e )A) fc (max{M (77), \}) k+1 C k , 



where C k = ^tpr { 2 k ) < 4 fc . Therefore, taking c = 8(1 + 2e)A proves the result. 



□ 
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This brings us to the main result of this section: that G(t, z) is analytic in t for \z\ > 1. 

Proposition 2.20. The function G(t, z) of Equation 1.8 is analytic in both variables (t, z)for t > and \z\ > 1. 

Proof. Since G(t, •) is analytic on C — [0, 1] (cf. Equation 1.8 and Definition 1.10), we need only concern 
ourselves with (real) analyticity in t. Let \z$\ > 1 and t > 0. Corollary 2.9 shows that t i->- G(t,zo) 
is C°°, and so it suffices to show that the Taylor series T to G(-,zo) of this function is convergent in a 
neighborhood oft = to- By definition (cf. Lemma 2.15), 



i ak 00 

T t0 G(t, zq) = £ T^Gfo, *))(* - t ) k = G k(to, zo)(t - t ) k . (2.64) 



^ kl dt k 

k=0 k=0 



Let < rj < min{|zo| — 1, 1}/ and < rf < rj; then Inequality 2.51 gives exponential bounds 

\G k (t ,z )\ < M k {rf) < 



c fe (max{Mo(7?), \}) k+1 



{rj — rj') k 

The root test then imples that the Taylor series T to G(t, zq) converges for \t—to\ < (rj— ?]')/cmax{Mo(jj), |}. 
This proves the proposition. □ 



2.6 Analytic Continuation to C + and the Proof of Theorem 1.2 

We have now proven the statement of Theorem 1.2 restricted to C+ — B: G(t, z) is analytic in both t > 
and z £ C + subject to the constraint \z\ > 1, and it satisfies the PDE 1.9 on this domain (cf. Corollary 
2.12). We knew a priori that G(t, z) is analytic in z for all z £ C + (cf. Equation 1.8 and Definition 1.10); 
it thus remains to extend the analyticity in t into the larger z-domain. This is actually quite simple, 
once we reinterpret t as a complex variable. 

Lemma 2.21. For each zq € C — B, there is e = e(zo) > so that 1 1->- G(t, zq) has an analytic continuation to 
the strip {t e C: > 0, |9?t| < e(zo)}- T/ze tolerance e(zo) is independent oft 

Proof. This follows immediately from the proof of Proposition 2.20: for t > 0, zq) is given by 
the power series T to G(t, zq) of Equation 2.64 centered at to, with radius of convergence at least (rj — 
r ?')/ c ( r ?) max {^oijf)-, \} for any < rj' < rj < min{|zo| _ 1>1}- We can therefore take rf I and 
rj t rjQ = min{|zo| — 1, 1}, and therefore define e(zo) = T]o/c(r]o) max{Mo(r?o) 5 ^} > 0, independent of to) 
these power series, for all base points to > 0, define the analytic continuation. □ 

Hence G(t, z) can be viewed as a complex analytic function of two variables. This brings to bear all 
the tools of many complex variables. We can now complete the proof of Theorem 1.2, with the help of 
a lemma of Hartog. 

Proof of Theorem 1.2. Fix a point (to, zq) £ M + x C — B. Let D be the largest disk centered at zq that does 
not intersect [0, 1], and let D' be the largest disk centered at zq that does not intersect B (so D' C D). 
Then G(t,z) is complex analytic in t > for z £ D' by Lemma 2.21. Since G(t,z) is the Cauchy 
transform of a positive measure u t = m — (1 — min{a, /3})5o of total mass < 1, supported in [0, 1] 
(cf. Equation 1.8 and Definition 1.10), it is complex analytic in z for all z £ D, and is also uniformly 
bounded on compact subsets of C + (cf. Inequality 1.13). It therefore follows from a lemma of Hartog 
[18, Lemma 2.2.11] that G(t, z) is jointly analytic in (t, z) for t > and z in the larger disk D. Applying 
this at each point of z$ £ C + — B shows that G(t,z) is analytic on M + x (C+ — [0, 1]), as desired. 
Thus, the functions on both sides of PDE 1.9 are analytic on this domain, and by Corollary 2.12 the are 
equal on the open set M + x (C + — B), it follows that they are equal on their larger analytic domain 
R + x (C+ - [0, 1]), concluding the proof. □ 
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3 Local Properties of the Flow fi t 

In this section, we develop properties of the measure ^ t directly from the PDE of Theorem 1.2 that 
determines its Cauchy transform. Let us define a new positive finite measure v t , supported in [0, 1], by 

fH = Vt + 0-- min{a, f3})5 (3.1) 

so that G(t, z) = G Vt (z) is the Cauchy transform of v t , cf. Equation 1.8. Since nt is a probability measure, 
the total mass of u t is 

u([0, 1]) = min{a!, 0} > 0. (3.2) 



3.1 Steady-State Solution 

■ ■■■■■■ 



To begin, as a sanity check, note that the steady-state equation (determined by j%G(t, z) = 0) takes the 



form 

d z [z(z- l)G 2 - (az + b)G] = 0. 

Due to the analyticity of G on the connected domain C+, this forces z(z — l)G 2 — (az + b)G to be 
constant. The constant can be determined from the known limit behaviour of the Cauchy transform: 



Thus 



lim zG(z) = z^([0, 1]) = min{a, /?}. 

\z\— >oo 



lim [z(z — l)G(z) 2 — {az + b)G(z)] = min{a, /3} 2 — aminja, /3}. 

\z\—>oo 



Using a = 2 min{a, /?} — 1, it follows that the steady state solution satisfies 

z(z — l)G oa (z) 2 — (az + b)Goo(z) = min{a, /3}(1 — min{o;, /?}) 
This quadratic equation has (simplified) solutions 

(az + b)± ^z 2 - 2(a + /3 - 2a/3)z + (a - (5f 



Goo(z) 



2z(z - 1) 



The discriminant matches that in Equation 1.5, but the terms outside the radical do not match; of 
course, that equation describes the Cauchy transform of the full limit measure (jl, while our measures 
vt have the point mass at z = removed. Adding it back in, 



l-min{a,/3} z + a + f3 - 2 ± yj ' z 2 - 2(a + f3 - 2a/3)z + (a - j3f 
h ^00(2:) — 



2z(z - 1) 

which precisely matches the Jacobi measure's Cauchy transform from Equation 1.5, as expected. In 
other words, using the Stieltjes continuity Theorem 1.11, we have: 

Proposition 3.1. The spectral measure [it ofqptq converges weakly to the free Jacobi measure \i of 'Equation 1.6 
as t — > 00. 

This was already known, as a consequence of the fact that pt, q are asymptotically free as t — ^ 00; it is a 
good sign that this can be seen directly from PDE 1.9. 



31 



3.2 Conservation of Mass and Propagation of Singularities 

The support of the measure fi in the limit as t — > oo is the full interval [0,1]; however, the initial 
condition fio is only constrained to have point masses of appropriate magnitudes at the endpoints 
{0, 1}, cf. Equation 1.6, and thus supp/xo may be any closed subset of [0, 1]. It is therefore possible 
that, for some t > 0, supp fi t is disconnected. The following results deal with the flow of such support 
"bumps" under PDE 1.9. 

Lemma 3.2. Let to > 0. Let U\, JJi be two disjoint open subintervals of [0, 1] (with the relative topology), and 
let K i c Ui and K2 C U2 be closed subsets. Suppose that supp fit Q Ki U K 2 . Then, for some e > 0, 
supp fi t C U\ U U 2 for all t e (to — e, t + e). 

Proof. Applying the analytic continuation argument in the proof of Theorem 1.2 (on page 30) to the 
larger domain C — U\ U Ui shows that G(t, z) is analytic in both variables for z in this domain, and 
PDE 1.9 also holds there in a neighborhood of time to- The result follows immediately. □ 

In particular, this shows that, if for some to > the support of fit a consists of a finite (or countable) 
collection of disjoint closed intervals in [0, 1], the same holds true for all t > to- Moreover, we can 
quantify the motion of the endpoints of these intervals. For the following, we assume that G Ut has a 
continuous extension to (0, 1) (i.e. fit has a continuous density); we will prove this assumption holds 
true for all t > in Section 3.3, at least in the special case a = f3 = \. 

Lemma 3.3. Suppose G(t, z) has a continuous extension to z € (0, 1) for all t > 0. Let xt be a point in the 
boundary of the support of fit- Then t \-t xt satisfies the ODE 

it = 2G(t, xt)x t {l - x t ) + ax t + b, t > 0. (3.3) 

Proof. The Cauchy transform of a compactly-supported meaure is one-to-one on a neighborhood of 00 
in C + , so at least for small w, there is an analytic function K(t, w) so that w = G(t, K(t, w)). To simplify 
notation, denote G(t,z) = G t {z) and K(t,w) = K t (w); let G' t (z) = j%G(t,z) and K' t {w) = J^K(t,w). 
Noting that w = Gt{K t (w)) for small w and differentiating with respect to t, we have 

= ^G t (K t (w)) = ^(KtH) + G't(K t (w))^( W ). (3.4) 



Now, PDE 1.9 can be written in the form 

(z) = (2z - l)Gt(z) 2 + 2z(z - l)Gt{z)G' t {z) - aG t (z) - (az + b)G' t (z), 



dG t 



ot 

and so, using Gt(K t (w)) = w, Equation 3.4 yields 

- G >(K tiw) ) d -§ iw) (35) 
= (2K t (w) - l)w 2 + 2Kt{w)(K t (w) - l)wG' t (K t (w)) -aw- (aK t (w) + b)G' k {K t {w)). 

The inverse function theorem shows that G' t (K t (w)) = l/K[(w). Combining this with Equation 3.5 
and simplifying, we have 



dt 



(w) = (2K t (w) - l)K' t (w)w 2 + 2K t (w)(K t (w) - l)w - aK' t {w)w - {aK t {w) + b) (3.6) 
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which can be written in the form 

-gf( w )=g^ [w 2 K t (w){l - K t {w)) + awK t (w)] + b. (3.7) 

The existence of a continuous extension of Gt to a neighborhood of (0, 1), coupled with standard ana- 
lytic continuation arguments, then implies that the PDE 3.6 holds up to the boundary Gt((0, 1)). 

Now, let x t be a boundary point of a support interval of v t ; then Gt is singular at xt- Let ?/£ = Gt(xt) 
be the corresponding singular value; then K' t (y t ) = 0. Hence, with w = y t , equation 3.6 asserts that 



dK t 
dt 

On the other hand, 



(y t ) = 2K t (yt)(K t (y t ) - l)y t - {aK t {y t ) + b) = 2x t {x t - l)y t - (ax t + b). 



dx t 9 dK t dy t dK t 

~dT = di Kt{yt) = -df {yt) + KM H = -df {yt 



since yt is (by assumption) a critical point for K t . Hence, we have the ODE 

x t = 2y t x t (l - x t ) + (ax t + b) 

which yields the result, since yt = Gt(xt). □ 

Equation 3.3 gives a precise (implicit) formula for the speed of propagation of singularities on the 
boundary: the edges of the support move with finite speed x t so long as they stay away from the 
endpoints {0, 1}, since G(t, x)x(l — x) is continuous on (0, 1). In the sequel, we will see that, although 
G(t, x) may blow up at x = 0, 1, the function G(t, x)y / x(l — x) remains bounded; thus, Equation 3.3 
yields further information: as any support "bump" approaches the endpoints, its speed decreases to 0. 

Remark 3.4. Equation 3.3 at first seems to suggest the boundary point x t , which is of course in [0, 1], 
evolves into the lower half-plane: indeed, the function G(t, z) takes values in the closed lower half- 
plane even for z £ [0,1]. However, xt is assumed to be at the boundary of the support of fit which, 
under the assumption of the lemma, possesses a continuous density. The Stieltjes inversion formula 
of Equation 1.14 then shows that QG(t, xt) = 0, since the density of /it is at a boundary point of the 
support; hence, G(t, x t ) is, in fact, real. 

Remark 3.5. If the support of fit is not the full interval, or more generally if Gt possesses a singular point 
in [0, 1], this singular point cannot dissipate in finite time; this follows from Hartog's second theorem 
and the continuity principle, cf. [18]. Lemmas 3.2 and 3.3 are in line with this observation: singular 
points propagate with finite speed and slow down as they approach the endpoints, therefore never 
dissipating. 

One final result that follows from this framework is conservation of mass of support "bumps". 

Lemma 3.6. Let to, U±, U2 be as in Lemma 3.2. The total mass of fitl^ is preserved for t G (to — e, to + e )- 

Proof. Since, by Lemma 3.2, supp u t C supp fit is contained in U\ U U2 for t G (to — e, to + e), the Cauchy 
transform G(t, z) = G Ut (z) is analytic for z G C — U\ U Ui- Let ol\ be a closed C 1 curve in C — U\ U U2 
which encloses U\ and has winding number around each point in U2. Then 

MU~i) = ^~i G(t,z)dz. (3.8) 

Equation 3.8 holds by the standard Residue Theorem if fit is a discrete measure - a convex combination 
of point-masses - suppored in U\', any measure may be weakly approximated by discrete measures, 
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and by the Stieltjes continuity Theorem 1.11 weak convergence of measures implies uniform conver- 
gence of the Cauchy transforms on compact subsets of C — U\ U U2, justifying Equation 3.8. 

Lemma 3.2 justifies that PDE 1.9 holds in C — U\ U C/ 2 , and the solution is analytic in z and t. In 
particular, it follows that the integral on the right-hand-side of Equation 3.8 can be differentiated with 
respect to t under the integral, so ^t{U\) is differentiable, and PDE 1.9 then gives 

f ^ - h £ m G(t - z > rfz = h £ I - w - ^ + <" z + 6 » G(4 - z « d * (3 - 9) 

for t G (to — e, *o + e)- The analyticity of G(i, z) in a neighborhood of the image of a\ now guarantees 
the integral on the right-hand-side of Equation 3.9 is 0, by the fundamental theorem of calculus. □ 

Remark 3.7. Of course, if support "bumps" merge in finite time, the total mass combines additively. In 
fact, Theorem 1.6 shows that, at least in the case a = ft = \, such collisions do not occur. 



3.3 Subordination for the Liberation Process 

This section is devoted to the proof of Theorem 1.5. We assume, from this point forward, that a = /3 = 
^ . It is plausible that similar techniques may apply more generally, but that discussion is left to a future 
publication. The proof is fairly involved: we essentially develop an analogue in the present context of 
Biane's theory of subordination for the additive free convolution. To clarify the proof, we begin with 
an outline. The case a = /? = | corresponds to a = b = in Equation 1.9. So v t is a positive measure of 
mass I supported in [0, 1], and its Cauchy transform G(t, z) = G Ut (z) satisfies the PDE 

-G(t,z) = — [z(z-l)G(t,z) 2 ], t > 0, z G C+. (3.10) 

We begin by making a change of variables. The function z \— > sfzyj ' z — 1 (where we use the standard 
branch of the square root function) is analytic on C — [0, 1]. Let 

H{t,z) = H t (z) = ^Vz^lG{t,z), t>0, zGC + . (3.11) 

Then H(t, z) is analytic in both variables for z G C+ and t > 0, and satisfies the PDE 

-H(t, z) = ^Vz~^l— [H(t, z) 2 ] (3.12) 
on its analytic domain. We introduce the auxiliary function M, and the domain S: 

M(w) = \e~ 2w {e 2w + \) 2 , S = {w = u + iv eC: u> \hx\,Q < v < f }. (3.13) 

The function M is entire; when restricted to the strip S, M\ s is injective, and its image M(S) = C+. 
Let L = (Mjg)" 1 ; explicitly 

L{z) = \hxM(z) = \hi{z-\ + Jz^z=T), z G C. (3.14) 

In fact L has an analytic continuation to C — (—00, 1] (given by Equation 3.14, and this extension is 
injective: it is the inverse of M on the larger domain S U S (closure union conjugate). On this domain, 
L'(z) = 2 ^fz\jz-\ ' M° re generally, L extends to a multivalued holomorphic function (on the infinite- 
sheet Riemann surface covering the slit plane C — [0,1]). 
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Given the solution H t to PDE 3.12, define the subordinator f t as 

f t (z) = M[L{z) + tH t (z)], z G C+. (3.15) 

This subordinator is a deformation of the identity: fo(z) = M(L(z)) = z. (One should be wary, 
however: when w G C + , L(M(w)) is only equal to it) for w G 5.) Using the method of characteristics, 
we will show that iJ(i, z) satisfies the fixed-point equation 

H t (z) = H (f t (z)). (3.16) 

Equation 3.16 transfers the dynamics of PDE 3.12 to a deformation f t of the identity, changing the role 
of Ht from active to passive. Immediately from this equation, we see that smoothness is propagated: 
if Hq\ c happens to have an analytic continuation to a complex neighborhood of [0,1], then the same 
must be true for H t for all t > 0. 

In fact, we will prove (Theorem 3.19) that, for any initial measure u and any t > 0, f t extends to a 
homeomorphism from the closed upper half-plane C + to a region in C + bounded by a Jordan curve; 
it then follows from Caratheodory's Theorem (cf. [22, Thm 2.6]) that H t has a continuous extension 
to C+. Since the measure u t is given by the boundary values of Gt, it follows that u t has a density pt 
which is continuous (with potential blow up at {0, 1} due to the factor y/x(l — x) relating Ht(x) and 
Gt(x) for x G [0, 1]). Finally, we show that the extension of H t actually has an analytic continuation to 
a complex neighborhood of any point in the interior of supp vt, concluding the proof of Theorem 1.5. 

To begin, we record the relevant properties of the auxiliary function L; the following lemma is a 
straightforward excercise. 

Lemma 3.8. Let L : C+ — > C be defined as in Equation 3.14. Then L is holomorphic and injective, its range is 
L(C + ) = S, and L satisfies 

f)T 1 

7T = r 1 r z e C+. (3.17) 

az 2-y/z-s/z - 1 

The exension of L to a multivalued function satisfies 

L[l,oo) = [iln|,oo), L(-oo,0] = [|ln|,oo) + if , 1(0, 1) = | In | + i(0, f ). (3.18) 
In particular, L has an extension to a holomorphic map on a complex neighborhood o/(0, 1). 

The first task is to prove the fixed-point Equation 3.16. 

Lemma 3.9. Let H t be the solution in the upper half-plane to PDE 3.12 with initial condition Hq analytic on 
C+. Define f t in terms of H t by Equation 3.15. Then H t (z) = Ho(f t (z)) for z G C+. 

Proof. PDE 3.12 is well set-up for applying the method of characteristics, cf. [13]. Writing it in the form 

dH dH 
— (t,z) = 2^Vz~^lH(t,z) — (t,z) 

we see that it is a homogeneous semilinear equation of the form 

b(t,z,H(t,z)) -VH(t,z) = (3.19) 
where b(t, z, w) = [l, -2^/zy/z~^Tw] . Fix (t , z ) G M+ x C+, and define z(t) = (t, z(t)) by the ODE 

^ = h[z(t),H(t,z(t))] (3.20) 
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passing through the point z (to) = (to,£o)- Then taking w(t) = H(z(t)) = H(t,z(t)) and applying 
the chain rule, Equation 3.19 shows that the characteristic curve (z(t),w(t)) is contained in the graph 
{(t,z,w) G M + x C + x C: w = H(t,z)}; moreover, the set of all such curves (for all choices (<o,2o)) 
traces out the entire graph. (This follows from the a priori knowledge that H(t, z) is analytic in both 
variables.) It is customary to write Equation 3.20 as a system for the separate variables z, w. Because 
of the homogeneity in Equation 3.19, the characteristic equations in our case are 

— = -2y/zyfz=Iw, (3.21) 
dt 

*=a ,3.22) 

Thus, w(t) = H(t, z(t)) is constant; so, in particular, we have 

F(0, 0(0)) = w(0) = w(to) = H(t , z(to)) = H(t , z ). (3.23) 

We can determine the position z(0) of the characteristic at time (given its position z(t ) = z ) from 
Equation 3.21. Since w = w(to) is constant, this can be explicitly solved in the upper half-plane, where 
the function r - 1 , has antiderivative L (cf. Equation 3.17). The solution is 

Z-y Z\ z — 1 



dz 



2yfz^~z-i 



-w(t )dt =>• L(z(t)) = -w(t )t + C 



for a constant C, which is determined by the constraint z(to) = zq; that is, C = L{zq) + w(to)to. Thus, 
at time the curve's value z(0) is determined by 

L(z{0)) = -w(to) ■ + L(z ) + w(t )t = L(z ) + t H(t , z ) (3.24) 

from Equation 3.23. It follows that L(zq) + toH(to,zo) € S, and so z(0) = M[L(zq) + toH(to, zq)] = 
/t (^o) by definition Equation 3.15. Thus, Equation 3.23 asserts that 

H(0,f to (z )) = H(t ,z ). 

Since (to, zq) G M+ x C + were arbitrary, this concludes the proof. □ 

Remark 3.10. The function f t = Mo(L + tH t ) need not, a priori, satisfy L o f t = L + tH t . Neverthe- 
less, Equation 3.24 actually clarifies that this further restriction does hold; in other words, since that 
equation holds true for all (to, ^o) G ^+ x C+, we have ft(z) G L(C+) = S for each z G C+, and 

L(f t (z)) = L(z)+tH t (z)=L(z) + tH (f t (z)), VzGC+. (3.25) 

The second equality follows from the statement of Lemma 3.9. It is also worth noting here that the 
domain of definition for the function H t in Lemma 3.9 is C + ; as such, the characteristic z(t) passing 
through zq at time to is necessarily contained in C+, and so ft (zo) = z(0) G C+. In other words, it 
follows from the proof that 

f t (C+) C C+, t > 0. (3.26) 

This can also be seen from the definition of ft, Equation 3.15, at least for small t > 0, using the Taylor 
expansion of M about L(z) together with a small-angle argument in a neighborhood of H in C + ; the 
details are left to the interested reader. 
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Remark 3.11. It is possible to derive the relation H(t, z) = H(0, ft(z)) in the following alternative fash- 
ion: define J (t, z) = Ho(f t (z)). Note that J(0, z) = Ho(fo(z)) = Hq(z). Elementary but laborious 
calculations show that J also satisfies PDE 3.12, and is analytic. It therefore follows from the Cauchy- 
Kowalewski theorem that J = H, as claimed. We prefer the method of characteristics approach above 
for two reasons: the calculations are much shorter, and they avoid some technical difficulties that arise 
differentiating J(t, z) at points z where ft(z) happens to be in [0, 1], outside the known analytic domain 
of H . These difficulties can be overcome with restriction and then analytic continuation techniques 
once continuity is proven; we prefer this more direct approach, which aids in the proof of continuity. 
It is worth noting that points z where ft(z) £ [0, 1] will play an important role in what follows. 

Remark 3.12. One might hope to use the method of characteristics to similarly deduce the existence 
of a subordinator for the solution of the general case, for (a, 0) / {\,\), i-e. (a, 6) / (0,0). The 
same change of coordinates H = \fz\J ' z — 1G transforms the general PDE 1.9 into the /^homogeneous 
semilinear equation 

dH r ._ . . ,,,dH (a + 2b)z-b TT 

The corresponding characteristic equations are 

dz 



— = — 2\fzV z — lw — {az + b), 

dw (a + 2b)z — b 

—r = ; s — w. 

dt 2z(z - 1) 

The inomogeneity generates a fully intertwined system of ODEs that is not explicitly solvable except 
in the special case a = b = 0; this is one demonstration of how the behavior in the case we presently 
consider is better than the general case. 

We next use the fixed-point equation to show that the solution H t is bounded for t > 0, uniformly 
in t away from 0. 

Lemma 3.13. For any compact subset K c C, there is a constant Ck so that 

\H(t,z)\ <max{C K /t,l}, z€KnC + . 
In particular, it follows that, for any 5 > 0, 

sup sup \H(t,z)\ < oo. 

t>5 z£C+ 

Proof. Consider the function H o M. For fixed e > 0, let B e = B 1 / 2+e (l/2) be the open ball of radius 
1/2 + e centered at 1/2 (i.e. a complex open ball containing [0, 1]). Since Hq(z) = \fz\fz — 1Gq{z) and 
Gq{z) = G VQ (z) is the Cauchy transform of a measure supported in [0, 1], Hq is analytic on C — [0, 1] and 
hence is bounded on the closed set C — B e . Thus, Hq o M is bounded on L(C — B e ). Since the function 
L is bounded on compact sets, there is an e-dependent constant R so that 



C - B R (0) C L(C - B e ). (3.27) 

Now, let K be any compact subset of C. Since sup^ \L\ < oo, we can find a constant Ck such that 

Vw G C \w\ + sup \L\ >C K =^ \H o M(w)\ < 1. (3.28) 

K 
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To see why, note that we may first choose Ck larger than R — sup^- \L\; then the set of w in question is 
contained in the set C — B e where Hq o M is bounded. Since we also know that the limit as \z\ — > oo of 
Hq{z) is 1/2, and that |M(io)| — > oo as \w\ — > oo, the estimate follows from the continuity of Hq o M on 
C-B R (0). 

Now, for any compact K C C, suppose there is a z G if — [0, 1] such that |Ht(z)| > Cx/t. Set iu = 
L(z) + fcff t (z). Then 

\w\ = \tH t {z) + L(z)| > t\H t {z)\ - \L(z)\ >C K ~ \L{z)\. 

Therefore 

\w\ + sup \L(z)\ > \w\ + \L(z)\ > C K - 

K 

From Equation 3.28, it follows that \Hq o M(w)\ < 1. But the fixed-point Equation 3.16 for H t then says 
that, for z £ K n C+, 

= K^o o M)[L(z) + tH t (z)}\ = \H Q o M(w)\ < 1. 

Thus, for any z G if n C+, if |i?t(z)| > Cx/t, then |i?t(z)| < 1. This proves the first statement of the 
lemma. For the second, note (as used above) that limui^oo zG(t, z) = vt[0, 1] = \ for any t > 0, and so 
lim| z |_>.oo H t (z) = \. Moreover, the analyticity in t and the convergence to the steady state (Section 3.1) 
shows that this convergence is uniform in t > 5. Thus, there is a compact set K and a fixed constant 
C so that sup t>< 5 sup z( £ K \ H(t, z)\ < C. Combining this with the first statement of the lemma proves the 
second. □ 

With Lemma 3.13 in hand, we now make the following assumption, without loss of generality. 
Assumption 1. iio is bounded. Thus sup t>0 sup 2gC+ \H(t, z)\ < oo. 

This assumption is justified by the semigroup property for the soluotion H t of PDE 3.12: given to > 0, 
the solution H l t ° of the PDE with initial condition H l Q = Ht is equal to H 1 " = Ht+t ■ So, in each of 
the following statements that hold for all t > 0, we may simply begin the proof by selecting some 
to £ (0, t) and proving the theorem instead for t — to > 0, then use the semigroup property; in so doing, 
we translate to initial condition H to which satisfies Assumption 1 by Lemma 3.13. So we may make 
this assumption without loss of generality, and freely do so for the remainder of this section. 

Remark 3.14. It is important to note that, while H t satisfies the semigroup property, its subordinator 
ft does not. Indeed, if we denote fl° as the subordinator corresponding to the solution with initial 
condition H to , then Equation 3.15 yields 

f t °(z) = M[L(z) + tH*°(z)\ ± M[L(z) + (t + t )H t (z)] = f to+t {z). 

We now proceed to demonstrate that the subordinator f t is a homeomorphism. First, we identify 
its range. 

Definition 3.15. For t > 0, define the region Q t e C + as follows: 

Sl( = {weC + : L(w) - tHo{w) G L(C+) = S}. (3.29) 

Lemma 3.16. The map f t is a conformal (one-to-one) bijection from C + onto 



38 



Proof. Let z\,z 2 G C+. If ft{z\) = ft{z 2 ), then by Equation 3.25, it follows that 



L( Zl ) + tH (f t (zi)) = L(f t ( Zl )) = L(f t (z 2 )) = L{z 2 ) + tH (f t (z 2 )). 



But the assumption ft(z\) = ft(z 2 ) then implies that L{z\) + tHo(ft(zi)) = ^(^2) + tHo(ft{z\)) and 
so -£1(21) = L(z 2 ); so zi = ^2 by Lemma 3.8, and thus f t is one-to-one on C + . By Equation 3.25, for 
any z G C+, L(f t (z)) = L(z) + tH (f t (z)) which means that (L - tH ){f t (z)) = L{z) G L(C+). This, 
coupled with Equation 3.26, shows that f t (z) G O t for all z G C + ; and so /t(C+) C Conversely, if 
it; G f^, then there exists z G C + such that L(w) — tHo(w) = L(z). Equation 3.25 shows that w = ft{z) 
is a solution to this fixed-point equation, and moreover the injectivity of L proves that it is the unique 
solution; hence w = ft{z), so w G ft(C + ); and so Qt Q ft{C+). □ 

We will next show that ft extends continuously to a homeomorphism C + — > Uf Were we to follow 
the approach in [5], here we would use the inverse function theorem applied to the putative inverse of 
ft- Indeed, Equation 3.25 states that (L — tHo) of t = L, meaning that the inverse h t of ft, should it exist, 
must he h t = M o [L — tHo). A strictly-positive lower-bound on the Lipschitz constant of h t would 
imply h t extends to a continuous one-to-one map from Qt C+, yielding the corresponding result 
for f t . Unfortunately, this approach is not possible in the present context, as the following example 
illustrates. 



7}(5o + 81). Hence vq = \5\, and so Gq(z) = 
y/z/2y/z— 1. Simple calculation shows that J^[L{w) — tHo(w)] = at the 
t/2, which is in the unit interval for < t < 2; so h' t (wt) = 0. In fact, this point is 
also in d£l t during this time interval. Restricting to t G (0,1) and letting s = 1 — t, we have 



Example 3.17. Suppose no is the Bernoulli measure no 

1/2(2! - 1), and H (z) 
point w = Wt = 1 



L(w t ) 



I In 



1 -t + i^/(2-t)t 



±ln± + ±ln 



s + iyl — s 2 



i In g + \i tan 1 



Also 



Hence 



tH (w t ) = t 



y/l-t/2 



L(wt) — tHo(wt) = i In i + z • i \/l — s 2 + tan 



,-1 VI 



Elementary calculus shows that, for s G (0, 1), this is in | In ^ + i(0, n/2) C L(C + ). Thus wt G and 
h' t (wt) = 0. So there is no strictly positive lower-bound on ||/it||Lip(n t )- 

Nevertheless, f does indeed possess a continuous, one-to-one extension to the boundary. They key 
issue is identifying the nature of dflf 

Lemma 3.18. The boundary dttt is & Jordan curve in C+. 

Proof. To begin, let us note that it suffices to prove the claim for all sufficiently small t > 0, by the 
semigroup property of the solution H t . Consider the fibration of C + provided by the vertical line- 
segments y i-)- x + iy for y > and fixed iel We will prove the following claim. 

Claim. For fixed x G M, if x + iyo G fit then x + iy G fit for any y > yo- 

This suffices to prove the lemma: it demonstrates that fit is the region above the graph of real-valued 
function of x G M. Since fit is open, this function is automatically uper semi-continuous, and therefore 
does not have any oscillatory discontinuities; hence, by appending vertical line segments to any jump 
discontinuities, we see that fit is bounded by a Jordan curve. 
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To prove the claim, we use the convexity of the strip S = L(C+). Indeed, fix two ^-probability 
measures vo, u l/ and let £1® and Sl\ be the regions corresponding to these measures. Fix s G (0,1), 
and set u s = (1 — s)uo + sv\, their convex combination, with corresponding region fi|. Let Hq(z) = 
y/zy/z — lG Vs (z); then 

/■i i 

L(z) - ti?o(-z) = - ivW* 3 ! / [(1 - s)z/ (du) + sui(du)} 

Jo z — u 

= L(z)-t[(l-s)H° (z) + sH 1 (z)] 

= (1 - «)[£(*) - «*<?(*)] + s[L(z) - tHl{z)}. 

Now, suppose z G for j = 0, 1. This means (cf. 3.29) that L(z) - tH J {z) G L(C+) = S. Since S is 
convex, any convex combination of the two points L(z) — tH ] {z), i = 0, 1, is also in 5; thus, we have 
shown that L(z) — tH^z) G L(C + ), and so z G 0| for < s < 1. It follows immediately that, if the 
Claim holds for and for Qj, then it holds for fif for < s < 1. Since every ^-probability measure is 
a weak limit of convex combinations of ^-point masses in [0, 1], it therefore suffices to prove the claim 
only for the special case that the initial measure of the form u = ^5 a for some point a G [0, 1]. 

The Cauchy transform of ^5 a is z \- > 2 {z-a) '> nence / we rnust study the function 

^ , s t f \ \fz\lz— 1 1 , / 1 r- i \ Z — 1 

F t>a (z) = L(z) - i^v = ln l z _ , _ t VV 

2(2 — a) 2 \ 2 / 2[z — a) 

To prove the claim, it suffices to show that, for each x G K, the image of the line segment y \-> x + iy 
under F t%a intersects the boundary of 5 at most once for y > 0. The following facts may be verified by 
elementary calculus. 



(1) For x < a, ^sF tia (x + iy) > for all y > 0. 

dy ' 



(2) For x > a, £-3F ta (x + iy) > for all y > 0. 



(3) For x G [0, 1], ^KF 4ja (x + iy) possesses at most one 0, and is > for large y > 0. 

(4) For x i [0, 1], ^F t)a {x + iy) > for all y > 0. 

Item (1) shows that the image of QF t: a never intersects the lower boundary y = when x < a, and 
item (2) shows that it intersects the lower boundary at most once when x > a. In both cases, since 
ty/zy/z — 1/2(2 — a) — > t/2 as \z\ — > 00, its imaginary part tends to 0; thus for sufficiently small t 
(independent of a), ^sFt >a never intersects the upper boundary y = |. As for KFt ;a , when x G [0, 1] 
note that $tF t (x + iO) = RL(x + iO) = i ln | constantly, and so item (3) shows that the image curve 
y 1 — ^ ^R.F t>a (x + iy) may initially dip below this level and return to intersect the line y = \ ln i once, or 
it may stay above this line for all y > 0; in either case, it intersects the line at most once. Similarly, for 
x ^ [0, 1], ^tF^ a {x -HO) > \ ln i for sufficiently small t > (independent of a), and so item (4) shows 
that in this case ^kF tA {x + iy) > | ln | for y > 0. 

We have thus shown that, for all sufficiently small i > 0, independent of a G [0,1], for any x G R, 
if Ft ia (x + iyo) G L(C+) then F t>a (x + iy) G L(C+) for all y > yo- This proves the claim in the case of 
initial measure \b a , and thence by the above convexity argument, proves the lemma. □ 

This actually suffices to prove the main result: that ft extends continuously (and, in fact, injec- 
tively) to the boundary. This illustrates the general principle that pathological boundary behavior of 
conformal maps is observable in the topology of the image of the boundary. 
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Theorem 3.19. For t > 0, the subordinator f t : C + — > Qt extends to a homeomorphism C + — > U t . 

Proof. Since ft is a conformal map defined on all of C+, and by Lemma 3.18 the boundary of /t(C+) is 
a Jordan curve, the result follows immediately from Caratheodory's Theorem, cf. [22, Thm 2.6]. □ 

Remark 3.20. Example 3.17 demonstrates that the continuous extension of the conformal map f t to M 
need not be smooth: it can certainly have singularities along the line (at the critical values of ff l = h t ). 
We will quantify exactly where such singularities may occur in Lemma 3.27 below. 



Corollary 3.21. For t > 0, the solution H t to PDF 3.12 possesses a continuous extension to (0, 1). 
Proof. Referring to Equation 3.25, we have L(f t (z)) = L(z) + tH t (z) for z 6 C+. thus 

H t (z) = - t [L(f t (z)) - L(z)) . (3.30) 

Theorem 3.19 shows that ft has a continuous extension to C+, whose range is Oj, contained in C+. By 
Lemma 3.8, L possesses an analytic continuation to a complex neighborhood of (0, 1), concluding the 
proof. □ 

Remark 3.22. From here on, we will use H t and ft to refer to the extended continuous maps defined on 
(0, 1). The continuity of all involved functions and the fact that Equation 3.30 holds on C + shows that 
it also holds for the extensions to (0, 1). 



Corollary 3.23. For each t > 0, the measure v t possesses a density p t , which is continuous on (0, 1), and 
satisfies the bound 

C 

Ptix) <-===, a: 6(0,1) 
v / x(l - X) 

for some constant C > independent oft. 

Proof. By Corollary 3.21, the map H t has a continuous extension to (0, 1) (which we also refer to as H t ). 
Thus 

G t {z) = l ^— H t {z) 

yJZyJz - 1 

possesses a continuous extension to (0, 1). The Stieltjes inversion Formula 1.14 in pointwise form then 
shows that, for x 6 (0, 1), 

p t (x) = --QGt(x) = - . 1 KHt(x) (3.31) 

71" 7T y/x(l - X) 

is the density of v t , which is therefore continuous. Finally, Assumption 1 (i.e. Lemma 3.13, together 
with the semigroup property) shows that ||-Ht||z,°o(]R) < oo, and so the stated result holds true with 

C = H-^llL°°(R)/ 7r - □ 

This proves most of our main Theorem 1.5. It also proves the motivating claim that the projections pt, q 
are in general position (in this case a = j3 = 1/2.) 

Corollary 3.24. For any two projections p, q with r(p) = r(q) = 1/2, the free liberation projections pt, q are in 
general position for all t > 0. 
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Proof. Recall from Equation 1.7 that p t and q are in general position if and only if the spectral mea- 
sure fit = fJ-qp t q satisfies ^{1} = max{a + f3 — 1,0} = in this case. The measures \xt and u t dif- 
fer only by a point mass at {0}, so /Uf{l} = ff{l}. Corollary 3.21 shows that the Cauchy transform 
G(t,x) = G Vt (x) satisfies sup zg g- 1 yfzyj 'z — \G Vt (z) \ < oo; Corollary 3.23 shows that z/ t has a con- 
tinuous density pt on (0, 1), with >/l — xpt(x) bounded in a neighborhood of x = 1. Hence, if in fact 
vt{dx) = pt(x) dx+'y 5\(dx) in a neighborhood of x = 1, then it would follow that G Vt (z) = G Pt (z) + j^j 
where \\fz — lG Pt (z)\ is bounded in a neighborhood of z = 1 in C+, which contradicts the claim that 
sn^ ze ^\^/z^/z~^lG Ut {z)\ < oo. □ 

Let us also note one more immediate Corollary of Theorem 3.19, together with Lemma 3.9, that will 
be very useful in Section 4. 

Corollary 3.25. Suppose that vq has a strictly positive density p$. Then the density p t of v t is strictly positive 
for any t > 0. 

Proof. From Equation 3.31, pt(x) is positive at any x £ (0, 1) for which ^R.H t (x) > 0. Equation 3.16 
asserts that H t (x) = Ho(f t (x)). The harmonic function ^RHq is equal to 7Ty/x(l — x)po(x) > on K and 
lim^i^oo H$(z) = g so it is strictly positive for large z £ C+; by the minimum principle, $lHo > in 
C+. Thus, since ft(x) is either in [0, 1] or in C+, the assumption that $lHo(y) > for y £ [0, 1] implies 
that H (f t (x)) > for all x £ (0, 1). Hence p t (x) > 0. □ 

Having Corollaries 3.21 and 3.23 in hand, we may apply the semigroup property as in Assumption 1, 
and so we freely make the following assumption from here forward, without loss of generality. 

Assumption 2. Hq has a continuous, bounded extension to (0, 1), and the initial measure vq possesses 
a density po which is continuous on (0, 1), for which sup^gjj y/ 'x(l — x)po(x) < oo. 

It remains to prove the smoothness claim of Theorem 1.5 for the measure v t . First, we need the 
following results on the behavior of the continuous extension of the subordinator f t on the boundary 
set [0,1]. 

Lemma 3.26. Lett > and x £ [0, 1]. If f t (x) £ [0, 1], then p t (x) = 0, and po(ft(x)) = 0. 

Proof. If ft(x) £ [0, 1], then L(f t (x)) — L(x) is purely imaginary, since !RL = i In i is constant on [0, 1], 
cf. Lemma 3.8. Hence, by Equation 3.30, 

m t (x) = ±X[L(f t (x)) - L(x)} = 0. 

It follows immediately from Equation 3.31 that pt(x) = 0, as claimed. Also, from Equation 3.16 (ex- 
tended to the boundary by continuity) Ho(f t (x)) = H t (x), so the assumption that ft{x) £ [0, 1] shows 
again by Equation 3.31 that po(ft(x)) = in this case. □ 

Lemma 3.27. Let t > and let x £ (0, 1). If pt(x) > 0, then f t is analytic in a neighborhood of x, and 
/ t (x)£C-[0, 1]. 

Proof. First, from Lemma 3.26, since pt{x) ^ it follows that ft(x) ^ [0, 1], as claimed. Now, from 
Theorem 3.19, ft : C + — > Of is a homeomorphism with inverse h t = M o (L — tHo) on Q t - In particular, 
L — tHo is one-to-one on this domain, and so (L — tHo)^ 1 ^ is well-defined on L(C+). What's more, 
since L has an analytic continuation to a neighborhood of (0, 1), and Hq is analytic on C — [0, 1], the 
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fact that L — tHo is one-to-one on all of L(C+) implies that (L — ii?o) ^ is analytic on the complement 
of (L - tH )([0, 1]). For x G [0, 1], if L(x) G (L - tH )([0, 1]) then there is some y G [0, 1] so that 
L(x) = L(y) — tH (y), which is precisely to say that x = M(L(y) — tH (y)) = h t (y), and so y = ft(x). 
Since y G [0, 1], this implies by the first statement of the lemma that pt(x) = 0. Hence, if pt(x) > then 
(L — tHo) -1 is analytic in a neighborhood of L(x), and so f t is analytic in a neighborhood of x. □ 

This completes all the elements needed for the proof of Theorem 1.5. 

Proof of Theorem 1.5. By Corollary 3.23, the v t possesses a continuous density pt with the appropriate 
behavior at the boundary points {0, 1}. Lemma 3.9 asserts that H t (z) = Ho(f t (z)) for z G C + ; the 
continuity of H t and ft on (0, 1) means that this holds for z = x G (0, 1) as well, and so H t {x) = 
Ho(ft(x)) where ft(x) G C+. Let x G [0, 1] be a point where pt{x) > 0. Lemma 3.27 proves that ft is 
analytic in a neighborhood of x, and that ft{x) ^ [0, 1]. Since Hq is analytic on C— [0, 1], the composition 
H t (x) = Hq o ft(x) is therefore analytic in a neighborhood of x. □ 

Thus, vt has continuous density which is analytic except at the boundary of its support. To conclude 
this section, we now prove Theorem 1.6. The main result is the following lemma. 

Lemma 3.28. For t > 0, let Z t = {x G R: pt(x) = 0} be the 0-set of the measure v t . Then Z t = ff 1 (Zo). 

Note: we are explicitly under Assumption 2 here. The semigroup property has been applied and time 
has been shifted some small positive amount so that uq possesses a continuous density po- The result 
does not say that the topology of supp u t in any way resembles the topology of the support of the 
original (unliberated) measure p qvq , which can be any closed set in [0, 1]. 

Proof. Suppose x G Z t/ so pt(x) = 0. Equation 3.31 shows that ^RH t (x) = 0, and Equation 3.16 then 
yields MH {ft{x)) = 0. Now, ftH {y) = <n^y{l - y)p t (y) > for y G R, and Iim kKoo H {z) = \ > 0, 
so RHq > for \z\ sufficiently large in C+. Hq is harmonic in C+, so it follows from the minimum 
principle that M (z) > for z G C+. Thus, 9lH (y) = implies that y G R. Hence, ftH (f t (x)) = 
implies that ft(x) = 0, and so by definition ft{x) G Zq. Thus Z t C ff (Zq). 

Conversely, if ft{x) G Zq, then y = f t (x) G R and 

= p (y) = 1 MH (y) = ^H (y) = KH (f t (x)) = KH t (x) 
iry/y(l - x) 

which gives pt(x) = by Equation 3.31 once more. Hence x e Z t , so ^{Zq) C Z t . □ 

Remark 3.29. It is tempting to conclude from Corollary 3.28 that suppz^ = / t _1 (supp vq). This is not 
generally the case, since ft maps the interior of the support of u t into the upper half-plane C+ . The 
subordinator f t is continuous and one-to-one, but it is not a map from [0, 1] — > [0, 1]. 

We have thus proved Theorem 1.6. 

Proof of Theorem 1.6. The subordinator f t is a homeomorphism. By Lemma 3.28, f t maps Zq onto Z t . 
Thus, the complementary sets supp v§ and supp ar e also homeomorphic. □ 

Lemma 3.28 gives a great deal of information about the flow of the measure: its behaviour is quite 
different from the free heat flow of [5]. In Biane's free heat flow, connected components of the measure 
can merge in finite time, producing a singularity potentially more severe than the original boundary 
singularities. (By the continuity principle, the collision point must stay singular and in the zero set 
of the density for all time, though not in the interior of the support.) This contrasts strongly with the 
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present behaviour: disjoint connected components of suppi^ never merge. In the limit, v t converges to 
the Jacobi measure (the continuous part of the measure in Equation 1.6) which in this case is the shifted 
arcsine law 

whose support is the full interval [0, 1]. Lemmas 3.2 and 3.3, together with Lemma 3.28, show that the 
disconnected support "bumps" of u t all flow at finite speeds into small neighborhoods of the endpoints 
{0,1} as time increases, crowding and eventually disappearing in the limit. 
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4 The Unification Conjecture for Projections 



This section is devoted to the Unification Conjecture for free entropy and information of projections. 
We briefly describe the setting of free entropy and information, and then formulate and prove a special 
case of the conjecture in the present context. The reader is directed to the excellent introduction [30], 
and the papers [29] and [16, 17], for a detailed treatment of the background. 

4.1 Free Entropy, Fisher Information, and Mutual Information 

Let x\, . . . , x n be self-adjoint operators in a Ili-factor (gf, r). For parameters N, m G N and R, e > 0, the 
set of matricial microstates of these operators, denoted Tr(xi, . . . , x n ; N, m, e), is the set of self-adjoint 
N x N matrices X\ , . . . , X n , with norm < R, all of whose mixed (normalized) trace moments of order 
< m are within tolerance e > of the corresponding mixed r moments of x±, . . . , x n . 

T R (xx, . . .,x n ;N,m,e) 

= {li,..,I re eM^(C): \tr N (X h ---X ir )-T(x il ---x ir )\ < e Vr G [m], (ti, . . . , i r ) £ [<} 

where [n] = {1, . . . , n}. The volume of this set (in the metric given by the trace norms) grows or decays 
exponentially in the square of the dimension N. The (microstates) free entropy x( x i , ■ ■ ■ ,x n ) is defined 
to be 

X(xi, ••• ,x n ) = sup inf limsup— ^ log Vol (Tr(xi, . . . , x n ; iV, m,e)). (4.1) 

R>0 m&N,e>0 

oo 1 v 

In the n = 1 case of a single self-adjoint operator x, Voiculescu calculated that the free entropy is equal 
to the logarithmic energy of the spectral measure \i x (up to an additive constant): if n x has a density 

H x (du) = p x (u) du, then 

3 1 f 

x(x) - - - 2 ln27r = / ln\u - v\p x (u)p x (v) dudv = E(p x ). (4.2) 

(If p x is singular, xi x ) = — oo.) For several non-commuting operators x±, . . . , x n , there is no analytic 
formula for the free entropy; although we will see below that, changing the definition as appropriate, 
the case of two projections does afford a closed-form analysis, cf . Equation 4.9. 

Let A, B c £/ be algebraically free unital *-subalgebras; A V B denotes the unital *-subalgebra 
generated by A U B. There is a unique derivation 5a-.b '■ Av B — ^ (Av B) ® (^4 V B) determined by 

5a-.b{cl) = a ® 1 — 1 <8> a a £ A 
6 A -.B(b) = b£B' 

(Uniqueness is guaranteed by the algebraic freeness of A, B; without this assumption, 5a-.b is not well- 
defined. It is important to note that algebraic freeness is not related to free independence in general.) 
The liberation gradient of A relative to B, j (A : B), is defined (if it exists) through integration by parts: 

j(A : B) G L 1 (W*{A V B),t) satisfies 

r[j(A :B)x] = t ® t (5 A :b(x)) , x G AV B. (4.3) 

The (liberation) free Fisher information of A relative to B, if* (A : B), is the square-L 2 -norm of the 
liberation gradient: 

p*(A : B) = \\j(A : B)f 2 = r[j(A : B)*j(A : B)}. (4.4) 



45 



(If j(A : B) does not exist, or exists in L 1 but is not in L 2 , <p*(A : B) = oo.) This definition precisely 
mirrors the conjugate variables approach to classical Fisher information, cf. [30]. As with free entropy, 
free Fisher information can rarely be computed explicitly. One important exception is in the case of 
two projections: if =^ = W*(p, q), then <p*(W*(p) : W*(q)) can be computed directly, cf. Equation 4.7 
below. 

The mutual free information of subalgebras A and B, i* (A : B), is defined in terms of the mutual 
free Fisher information, via the liberation process: 

1 r°° 

i*(A: B) = - / y*(u t Au* t : B) dt (4.5) 

2 Jo 

where u t is a free unitary Brownian motion, freely independent from A V B. This definition is arrived at 
in [29, Section 4] from the classical relation between Information and Entropy. Indeed, if S(Xi , . . . , X n ) 
denotes the Shannon entropy of random variables Xi,..., X n , the mutual information I(X : Y) of a 
pair is defined to be I(X : Y) = —S(X,Y) + S(X) + S(Y). Starting here, and using a microstates- 
free infinitesimal version of free entropy x* > Voiculescu gave a convincing heuristic (modulo continuity 
issues at t = 0, oo) that if i*(x : y) = —~x*{x,y) + x*( x ) + X*{y) then Equation 4.5 should hold for 
A = W*(x) andB = W*(y). 

At present, it is unknown whether % = %* in general, and it is similarly unknown whether the 
heuristic used in [29] to give the definition of i* can be made rigorous. This question (along with the 
claim that x = X*) is known as the Unification Conjecture. 

Conjecture 4.1 (Unification Conjecture). Let x\, . . . , x n and yi,...,y n be self-adjoint operators in a 
factor {sf, t), such that x(xi, x n ), x{yi, y n ), and x{xi, . . . , x n , y x , . . . , y n ) are all finite. Then 

i*(W*(x 1 , ...,x n ):W*(yi,..., y n )) = ~x{xi, ■ ■ ■ , x n , y 1} . . . , y n ) + x(xi, ...,x n )+ x{yi, • • • , Vn)- 

Remark 4.2. One benefit of free mutual information i* , defined in Equation 4.5, is that it is finite in 
many situations where free entropy is not; for this reason, the finiteness assumption on x is needed for 
the statement of Conjecture 4.1 to be plausible. 

As noted in [30] and still true today, the result of Conjecture 4.1 is a long way from where the theory 
is at present. 



4.2 Free Entropy for Projections 



In the special case n = 2 and generators p, q that are projections, the abstract quantities of Section 4.1 
can be described more directly. Following [17] and [29, Section 12], denote 



E n =pAq, Eio = pA 



01 



p 1 - A q, E 00 = p Aq 



and let cty 



where p = 1 — p and q = 1 — q. Define E = 1 — (E 00 + Eq\ + Ei + En] 
r(Eij) for i,j £ {0, 1}. Then Eij (and hence E) are in the center of W*(p,q), and so the compres- 
sion \EW*(p, q)E, r\ EW *( p is isomorphic to the 2 x 2-matrix-valued L°° space of a measure v. the 
representations EpE <H> M p , EqE O M q given by 



M p {x) 



1 




M q (x) 



x 



y/x(l - x) 
1 - x 
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identify the compression as W*(M P , M q ) = L°°((0, 1), v; M^iC)) for a uniquely-defined positive mea- 
sure on (0, 1) with mass ^((0, 1)) = 1 — (aoo + "01 + aio + an)- The trace restricted to EW*(p, q)E is 
then given by 

r(a) = / \x[M*{x)]v{dx). 



o 



In this context, the measure v encodes all the structure of the algebra; we will soon see that v is indeed 
related to the spectral measure of the operator valued projection qpq, our central object of study. In [29, 
Prop 12.7], it was shown that the (liberation) free Fisher information of W*(p) relative to W*(q) can be 
explicitly computed in terms of the measure v. 

Proposition 4.3 ([29]). Suppose aooaii = aio"oi = 0. Suppose that v(dx) = p(x) dx has a density p e 
L 3 ((0, 1), x(l — x) dx). Assume that 

V«oi + «io + aoo + an \ p{x) ^ < ^ (46) 



o V *^ — ^ J 

Let 4> denote the Hilbert transform of p, modified by the atj as follows: 

i/ \ tt I \ , a oi + «io . "oo + "ii 

<p(x) = Hp(x) H 1 . 

x 1 — x 

Then the (liberation) free Fisher information ofW*(p) relative to W*(q) is given by 



<p*(W*(p):W*(q))= [ (j)(x) 2 p{x)x{l - x) dx (4.7) 
Jo 

and is finite. 

Remark 4.4. If r(p) = a and r(q) = /3 as in the foregoing, then the assumption aooan = aio"oi = is 
equivalent to the identifications an = max{a + /3 — 1, 0}, aoo = max{l — a — (3, 0}, a\o = max{a — /3, 0}, 
and aoi = max{/3 — a, 0}. In particular, in our special case a = /3 = \, this assumption is tantamount 
to aij = 0, meaning that E; L j = for i, j = {0, 1}; thus, the assumption is that p and q are in general 
position. If they are not, we cannot expect a simple relationship between the Fisher information and 
the density. Note that, in this case, Equation 4.6 holds trivially. 

With Proposition 4.3 in hand, we can hope to explicitly verify the Unification Conjecture 4.1 for 
the case of two projections. There is a twist, however. Going back to the definition of free entropy in 
Equation 4.1, dimension considerations show that, unless p = q = 1, x{p) = x(q) = x{p> l) — —oo; 
hence, these operators do not fit the statement of the Unification Conjecture per se. Instead, a modified 
version of free entropy for projections, Xproj/ is required. 

Let 9(-^V, k) denote the Grassmannian manifold of rank-fc projections on C N . Any projection in 
9(iV, k) is conjugate to the projection diag(l, 1, . . . , 1, 0, 0, . . . , 0) with k Is; the conjugating unitary U G 
U (N) is only determined up to the block structure of this diagonal projection, and so U is invariant 
under the action of U(k) x U(N - k). This gives an identification of 9(iV, k) ^ U(N)/(U{k) x U(N - k)) 
as a symmetric space. Let ttn± '■ U(N) — > S(iV, k) be the quotient map, and define 

!<5{N,k) = Haar C7(A r) o vr~^. 

Thus 7g(AT,fc) is the unique unitarily invariant probability measure on the Grassmannian of appropriate 
dimension/rank. They key to defining microstates free entropy for projections is to use this measure in 
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place of Euclidean volume: if the rank of the limit projections is not full, then they cannot be properly 
approximated by microstates of full rank. 

Fix projections p\, . . . ,p n . For 1 < i < n, let (fcj(iV))^ =1 be sequences with the property that 
ki(N)/N — > r(pi) as N — > oo. Define r pro j(pi, ... ,p n ; k\(N), k n (N); N, m, e), the projection mi- 
crostates, to be the set of projection matricies Pi, . . . , P n with Pj G 9{N, ki(N)), all of whose mixed 
(normalized) trace moments of order < m are within tolerance e > of the corresponding mixed r 
moments of pi, . . . ,p n . That is, r pro j is 

n 

{(Pi,...,P n ) G Y[S(N,ki(N)): \tT N (Pi 1 ---P ir )-T(p n ---p ir )\ <eVrG [m],(i u . . . ,i r ) G [<}. 

i=l 

Following Voiculescu's remarks in [29, Sect 14], in [16, 17] Hiai and Petz defined the projection free 
entropy as follows: 

Xproj (pi, • • • ,Pn) 

= W f n limsu P^72 ^S^TSfJV^ifiV)) ( r proj(pi,- • • ,p n ;ki(N), . . . , k n (N); N, m, e)) . 

i=i 

The value does not depend on the specific sequences ki(N). Note that, when n = 1, all moments 
tr Ar(P m ) = tr jv(P) = ki(N) and r(p m ) = r(p), so the microstate space in this case is the full Grass- 
mannian, which means that Xproj(p) = for a single projection. For two projections, Xproj can be 
explicitly calculated using large deviations results for projections, cf. [15, Thm 3.2, Prop 3.3]. The result 
is as follows. 

Proposition 4.5 ([15]). Suppose aoo«ii = aioaoi = 0. There is an explicit constant C(a, j3) (where a = r(p) 
and (3 = r(q) as usual) such that 

X P roj(p, q) = \v(y) + ai0 + aQ1 j" log x v{dx) + ail + a °° £ log(l - x) v{dx) - C(a, /3). (4.9) 

Remark 4.6. Following Remark 4.4, in the special case a = (3 = i the assumption aoodii = «io«oi = 
is equivalent to the assumption that p, q are in general position: r(p A q) = 0. In this case a^- = for 
i, j G {0, 1}; moreover, the complicated constant C(a,/3) satisfies C(|, |) = (cf. [17, Equation 1.2]). 
Thus, Xproj (p, q) = |S(i/) in our case. 

Thus, a natural extension of Conjecture 4.1 is to ask that it hold for projections, where the free 
entropy terms on right-hand-side are replaced with projection free entropy terms, x ~^ Xproj ■ We state 
this formally in the case n = 1 as follows. 

Conjecture 4.7. Let p, q be projections in a Yi\-factor. Then 

i*{W*(p) : W*(q)) = -Xproj (p, q) + Xproj (p) + Xproj (?) = -XprojO,?)- (4.10) 

The remainder of this paper is devoted to the proof of Conjecture 4.7, in the special case r(p) = 
r(q) = \, precisely given as Theorem 1.7. 
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4.3 The Proof of Theorem 1.7 

The theorem is stated on page 5; it asserts that Conjecture 4.7 holds true in the special case a = f3 = \, 
under the additional mild assumption that the operator valued angle p qpq = \5q + v qvq has a strictly 
positive density v qvq (dx) = p(x) dx for which y/x(l — x)p(x) is bounded on BL The upper bound is 
guaranteed to hold for the corresponding measure of the free liberation (p t , q) at any t > 0, and so 
it is very natural here. The assumption that the initial density vanish nowhere implies the same for 
pt by Corollary 3.25, and hence, by Theorem 1.5, this liberated density is real analytic for t > 0; this 
allows straightforward justification of the differentiations and integrations to follow. Our method is 
to employ Propositions 4.3 and 4.5 to precisely calculate the two sides of Equation 4.10. Note, as 
explained in Remarks 4.4 and 4.6, these Propositions require the assumption that p, q are in general 
position, which follows from the existence of the denisty (cf. Equation 1.7). 

The idea for the present proof is essentially due to Hiai and Ueda, and is very briefly outlined in 
[17, Sect 3.2]. Using the explicit formulas of Equations 4.7 and 4.9, direct calculation demonstrates 
that ^Xproj(ptj q) = \v*(pt '■ <?)• (This is in line with Voiculescu's original heuristics in defining free 
Fisher information: a version of this equality - differentiating entropy along a Brownian perturbation 
yields Fisher information - does hold for classical entropy and Fisher information.) Integrating this 
using Equation 4.5 defining mutual free information, the fundamental theorem of calculus shows that 
i*(W*(p) : W*(q)) = — lim f ^ Xproj(pt, q)} passing the limit inside x P roj is then justified using the as- 
sumed bound on the density. The technical difficulties arise in the derivative calculations; it is here 
that our main smoothing Theorem 1.5 will play a central role in the analysis. 

To begin, we identify the measure v in Propositions 4.3 and 4.5 with the (continuous part of the) 
operator-valued angle studied in Sections 2-3. 

Lemma 4.8. Let p, q be projections with trace r(p) = r(q) = \, satisfying the assumptions of Theorem 1.7. Let 
(pt,q)t>o be their free liberation. Denote by v l the measure characterizing W*(pt,q), cf. the discussion at the 
beginning of Section 4.2. Then v l = 2v t , the measure of Theorem 1.2 and 1.5. 

Proof. Let v t be as in Theorems 1.2 and 1.5. By Corollary 3.25 and Theorem 1.5, the density pt of v t 
is real analytic on (0, 1). Let ut,vt be the real and imaginary parts of the Cauchy transform of u t : 
G(t, z) = G Vt (x + iy) = ut(x, y) + ivt(x, y). Equation 1.16 identifies the density pt as vt(x, 0) = —-irp t (x), 
and its (scaled) Hilbert transform 4> t = —irHpt is (f>t{x) = Hv t (x, 0) = —ut(x, 0). (We scale the Hilbert 
transform by — tt here to match up to the notation in [17].) Since pt is analytic in (0, 1), Gt\j^ has an 
analytic continuation to a neighborhood of (0, 1). Hence, PDE 3.10 extends to (0, 1), and we have 

|U t (x, 0) = §-%G t (x) = ^ [x(x - l^G^Or) 2 )] = 2^- [x(x - l)u t (x, 0)v t (x, 0)] . 



Thus 



d d 
--^iTpt(x) = 2 q^[ x (x - l)(-<f> t (x)irpt(x))]. 



Dividing by 7r and and writing this in terms of the scaled functions 2p t and 2(f) t yields 

-(2 P i(x)) = —[x(x - l)2&(x) • 2pt(x)]. (4.11) 

This is precisely the differential equation for the density of the measure v l , Equation 3.1 in [17], in the 
case a = j3 = ^ (so ay = for i,j e {0, 1}) under the assumption that measure has a smooth (enough) 
density. Thus, Theorem 1.5 and the Stieltjes inversion formula 1.16 show that 2p t is indeed the density 
of the measure v l ; in other words, v t = 2v t . □ 
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Remark 4.9. This technique may be carried out for general a, /3 to show that the measure v is propor- 
tional to the continuous part of the spectral measure of qpq, provided smoothness may be proved first. 
It should be possible to see directly that the two measures are (up to a proportionality constant) equal, 
since they are both defined naturally in terms of the operator-valued angle qptq and the intersection 
p t A q; however, such a direct identification is not obvious to the authors. 

Henceforth, write p t = 2p t and $ = 2(p f . Thus, Equation 4.11, which is now proved rigorously as a 
consequence of Theorems 1.2 and 1.5 and Corollary 3.25, takes the form 

^ t p\x) = £-[x{x - l)^{x)p\x)]. (4.12) 

We now wish to use Propositions 4.3 and 4.5 to calculate i*(pt, q) and XprojO^i <?); to do so, we must 
verify the integrability condition of Proposition 4.3. 

Lemma 4.10. For t > 0, p t £ L 3 (x(l - x) dx). 

Proof. This follows immediately from the bound of Equation 1.10: 

\\pt\\h{ x (l- x) dx) = J Pt{xfx{l-X)dx 

□ 

We now prove the main comparison that will lead to the proof of Theorem 1.7. 
Lemma 4.11. For t > 0, 

j t X P roj( P t,q) = l<p*(W*(pt) : W*(q)). (4.13) 

Proof. By Corollary 3.24, p t and q are in general position for t > 0. In particular, aij = for i, j G {0, 1}. 
Hence, Proposition 4.5 and Remark 4.6 say that 

1 If 

X P ro){pt,q) = = 2 ^\x-y\p t {x)p t (y)dxdy. (4.14) 

Now, Equation 4.12 implies that 

^p\ x )p\y)) = p^x^iy) + ptiy^/ix) 

= ff(x)^ y - i)4> t (y) P t {y)\ + P\y)^[<x - i)<t>\x) P \x)}. 

Integrating against the logarithmic energy kernel, by symmetry we have 

- j \n\x-y\ — [p t {x)p t {y)}dxdy = - j p t (y) In \x - y\— [x(x - l)4> t (x)p t (x)] dxdy. 

Let fe*(x) = x(x — l)(f) t {x)p t {x). The Hilbert transform interchanges harmonic conjugates; hence, the 
analyticity of p l implies the analyticity of </>*, and so k l is analytic on (0, 1). The dependence of the 
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integrand on t is thus also analytic (cf. Section 2.5), and this easily justifies differentiating under the 
integral sign. We now integrate by parts, first writing the integral as a principal value: 

If d 

= 2 / P (y) ln \ x ~ y\fa, k\x)dxdy 

=- I p t (y) dy lim I / In \x — y\ — k l (x) dx + / hi\x — y\—k t (x) dx) 
2 Jo 40 \Jo dx J y+£ dx J 

=- F pHy)dy\im fine- kHy-e) - F ' dx - In e • kHy + e) - F dx] 

2 Jo 40 V Jo x-y J y+t x-y J 

where the first equality follows from the continuity of dk t {x) / dx. The boundary terms from the inte- 
gration by parts combine to 

fctfy _|_ (A k^(y e) 

lne[k t (y — e) — k l (y + e)] = — elne > uniformly on compact subset of (0, 1) 

since k l analytic (so, in particular, C 1 ). Hence, recombining, 

d . . 1 / f y - e k\x) J f 1 k\x) J \ 

dt 2 J 40 \Jo x-y J y+e x-y J 

i / ri uti 



p t (y)dy (p.v. I ^-^-dx] . 

V Jo x-y J 

Since p l is supported in [0,1], we can write this as 



d 7T f 

j t X P roj{pt,q) = ~ J^p t (y)H{k t ){y) dy. 

Write the kernel as k l (x) = \f x(l — x)4> t {x) ■ y 1 x(\ — x)p t (x); up to factors of ix, these are the real 
and imaginary parts of H t (x) (cf. Equation 3.11) which, by Lemma 3.13, is uniformly bounded. Thus 
k l £ L°° and is supported in [0, 1]; hence k l G L 2 (R). The Hilbert transform is self -adjoint on L 2 (R), cf. 
[25], and so 

d Xvro i { P uq) = -^ I P \y)H{k t ){y)dy = -^ [ H{j?)(y)lf(v)dy. 



dt ^^"^ 2 J R r Kt " v""-' 2_ 
In the proof of Lemma 4.8, we saw that $ = —irHp 1 ; thus, we have shown that 

4x P roj (Pt,9) = \ I ' 4> t {y)k t {y)dy = ]- [ 4> t (y)x(l - x)4> t (y)p t (y)dy. 

at z JR z JO 

By Lemma 4.10, the denisty p l is in L 3 (x(l — x) dx), and so Proposition 4.3 applies. Hence, Equation 
4.7 concludes the proof: 



d_ 
~dt 



X P roj(pt,g) = F tf(t) 2 p t (y)x(l -x)dx = <p*(W*(pt) : W*(q)). 
Jo 



□ 

Corollary 4.12. Ifp, q satisfy the assumptions of Theorem 1.7, then 

i*{W*{p): W*{q)) = -limxproj(p^). (4.15) 
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Proof. Lemma 4.11 shows that t \-> Xproj (j>t,<?) is differentiable for t > 0, with derivative equal to 
^ip*(W*(pt) : W*(q)). Equation 4.4 defining the (liberation) Fisher information ip* shows that it is 
manifestly > 0, and so the function t h-> Xproj(Pt) q) is non-decreasing. Moreover, Theorem 2.1 in [17] 
(the main result of that paper) shows that — Xproj (pt, q) < <P*(W*(pt) : W*(q)) under our assumptions; 
whence 

/■OO /"OO 

- / Xprojfe,?)^ < / <p*(W*(pt) ■ W*(q))dt = 2i*(W*(p) : W*(q)) (4.16) 
J ■/ 

by Equation 4.5 defining i*. Since ¥>*(W*(p) : W*(q)) is finite (by Proposition 4.3 and Lemma 4.10), [29, 
Prop 10.11(c)] guarantees that i*(W*(p) : W*(q)) is finite. Hence, Inequality 4.16 certainly guarantees 
that lim^oo x pro j (pt, q) = 0. Ergo, by Lemma 4.11, 

If 00 f°° d 

i*(W*(p):W*(q))=-J o <p*(W*(pt):W*(q))dt = J ^Xproj(Pt,?)^ 

= lim Xproj {Puq)~ lim x P roj , <?) 

£700 



- lim Xproj (Pt, 



□ 



Thus, in order to complete the proof of Theorem 1.7, it suffices to show that t \-t Xpmj{pt,q) is 
continuous at t = 0. 

Lemma 4.13. Under the assumption of Theorem 1.7, lim^o Xproj (pt, q) = Xproj (p, q)- 

Proof. As 1 1 0, the free unitary Brownian motion u t converges (in the strong operator topology) to the 
identity; hence qptq — >■ gpq, and the measure /^ converges weakly to /io- It follows that the density pt 
converges to p pointwise on (0, 1). Utilizing Proposition 4.5, 

1 If 

X P ro){pt,q) = = 4 / \n-\x-y\p (x)p\y)dxdy. 

Since p t (x)p t (y) — >■ p(x)p(y) for almost every x, y as t 1 0, we need only show that the kernels In \x — 
y\p t (x)p t (y) are uniformly dominated by an ^([0, l] 2 ) function to apply the Dominated Convergence 
Theorem to prove the result. 



By assumption, p(x) < C/y/ x(l — x) for some constant C, and by Theorem 1.5, this will also hold 
true uniformly in t > 0: p t (x) < C / yj x(l — x) for C independent of t. Thus, for x, y G [0, 1] and t > 0, 

C 2 

|ln \x — y\p t (x)p t (y)\ = — In \x - 



y/x(l - x)y/y(l - y) 

This function is indeed integrable: its integral is equal (up to the constant C 2 /it) to — E(Arc) where 
Arc(cfa) = dx/ny/ x(l — x) on [0, 1] is the shifted arcsine law. By Equation 4.2, this value is, up to 
additive constants, equal to — x(Arc), which is known to be finite cf. [28]. □ 
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